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Preface 


The essays in this book collect some of the lectures I have given at various 
places. Although the lectures ranged over a wide array of topics, they had a 
single theme which reflected my deep interest in the role of symmetry and 
supersymmetry in quantum theory. As such they are mathematical as well as 
personal. 

Symmetry and supersymmetry, especially in quantum theory, are expressed 
in the language of the theory of unitary representations. This is a subject 
of great intrinsic beauty and enters other parts of mathematics at a very 
deep level. Two of the greatest figures in its history are Mackey and Harish- 
Chandra. Their work (to use the words of Wey!) affords shade to large parts 
of present day mathematics and high energy physics. It is to their memory 
that this volume is lovingly dedicated. 

The essays should perhaps be viewed like a stroll through a garden of 
ideas: quantum algebras, super geometry, unitary supersymmetries, differen- 
tial equations, non-archimedean physics, are a few of the topics encountered 
along the way. My own mathematical education evolved out of interactions 
with Mackey and Harish-Chandra, and I conclude this volume with brief 
portraits of their work, embedded in the context of personal reminiscences. 

Quite a large number of people have had a hand in shaping my views 
expressed in these essays. In the essays themselves I have made an effort to 
mention some of them. But I must acknowledge my deep gratitude here to my 
longtime friend and collaborator Don Babbitt for the hundreds of discussions 
on the topics discussed here, spread literally over a lifetime. 

I am also indebted to Don Blasius, who, despite many demands, personal 
as well as professional, on his time, took a deep interest in these essays and 
made many suggestions for improvement. A first draft of a substantial part 
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of this book was written while my wife Veda and I were vacationing in the 
Dordogne, living in Don’s house in May of 2009. The delights of rural France 
were a big inspiration for me and so I should thank Don and Peter, as well as 
Barry and Julie (not to mention Sukie!), for making life so pleasant in Chez 
Blasius. 

Finally, | wish to thank Ann Kostant for the help and encouragement she 
has given me throughout this enterprise. It was her enthusiastic acceptance of 
my suggestion, made in the Spring of 2009, that Springer publish a book of 
essays collecting together some of the lectures I have given at various places, 
that was the genesis of this book. I really cannot thank her enough. 


Pacific Palisades 
Summer 2010 


Chapter 1 
Prologue 


The world is built with a great harmony but not always in 
the form which we expect before unveiling it. 
Goro Shimura 


Some thoughts on reality and its description, as well as some personal recollections. 


1.1 Reality and its description 
1.2 A quantum education and evolution 


1.1 Reality and its description 


I have always admired the profound insight behind the remark of Shimura 
quoted above. He was actually referring to the world of mathematics! but I 
have modified his remark to mean the physical world since it makes even more 
sense in that context. The essays that follow, which are essentially reworkings 
of lectures I had given at various places, give, among other things, a highly 
personal view of the quantum world that has evolved in the twentieth century 
as an offspring of the genius of many great physicists and mathematicians. My 
discussions are neither complete nor entirely objective, but there are aspects 
to them that I feel are rather compelling. This prologue is something like 
an introductory discussion that sets the stage for what is to come and gives 
a preview of the themes to be explored. I have mixed it with some personal 
reminiscences of how these ideas got sorted out in my own mind. 

It is generally agreed that it was Galileo Galilei who first insisted that 
the description of physical reality must be in the language of mathematics. 
However, the phrase physical reality in the above statement should not be 
taken in any absolute sense; I use the phrase to mean just what we are able 
to perceive under current circumstances. Thus it certainly has a provisional 
character, with a more comprehensive meaning to us than to Galileo. Based on 
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our past experience it appears that physical reality is layered, as our abilities to 
make observations improve, we peel off the existing layers to peer inside the 
new layer and try to come to an understanding of it. The new perceptions often 
force us to invent new mathematics to describe them, as well as new ways of 
setting up the dictionary between mathematics and phenomena, which may 
be completely different from the earlier ones. 

I want to emphasize that this is not a question of setting up an axiomatic 
scheme and showing that the new theories are just natural consequences of 
these axioms. Indeed there is “no functor in the sky” that will reveal the 
secrets of the physical world to the mathematician. Rather, it is the creation 
of a new way of interpreting phenomena that is mirrored in the mathematical 
models we propose. These models seem perfect till we penetrate to the next 
layer, when their inadequacy is revealed, and we are confronted with new 
problems and new dilemmas. 

Nothing illustrates this better than the transition from classical to quantum 
mechanics which is one of the most dramatic in the entire history of science. 
Many serious mathematicians and physicists believed, at the turn of the twen- 
tieth century, that the task of the natural philosopher was finished, and all that 
remained was the computation of the fundamental constants of nature to ever 
greater accuracy. But in just a short time, spectroscopic observations over- 
whelmed the ability of classical electromagnetic theory to explain and predict 
them. Indeed, according to the classical electromagnetic theory applied to the 
model where the electrons are circling around a nucleus, the moving electrons 
will radiate and continually lose energy; this would imply that the spectral 
lines would be continually shifting, and ultimately the electrons will fall into 
the nucleus. This is in stark contradiction to what is observed: the existence 
of sharp spectral lines. Thus the stability of matter could not be accounted for 
on the basis of classical electromagnetic theory. Clearly, deeper explanations 
were required. A great step was taken by Niels Bohr when he created what 
is now called the Bohr atom. For a certain period of time it was enough to 
work with this hybrid model where the classical structure was buttressed by a 
set of rules (quantum conditions) that took into account the new phenomena. 
After spectacular initial successes discrepancies began to appear and cloud 
the picture, as new phenomena could no longer be explained except by mak- 
ing artificial and ad hoc modifications of the theory of Bohr. Eventually it 
was recognized that new principles had to be introduced in describing atomic 
phenomena. The first step in this transition, which was indeed a gigantic one, 
namely the creation of a fundamentally new mechanics, was taken by Heisen- 
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berg. He started with the basic principle that the new mechanics should be 
based on only quantities that are physically observed. In the case of atomic 
transitions this meant to him that each physical quantity should be represented 
by an infinite matrix indexed by the energy levels £,, whose entries x,,;, are 
the complex amplitudes associated with the transition from E,, to E),. In an 
absolutely mysterious manner he realized that the algebra of matrices entered 
into the picture because he stipulated that if X is the matrix associated with 
the physical quantity x, then X” is the matrix corresponding to the quantity 
igus 2)... +). 

The idea that matrices represented physical quantities, with its mystic ori- 
gins in Heisenberg’s mind, introduced a strange new complication in the 
mathematics because of the fact that matrix multiplication is non commu- 
tative. Indeed, Heisenberg believed at first that this was a weakness of his 
theory. But it was Dirac who understood that instead of being a weakness 
it was the key to the description of the quantum world. To Dirac the set of 
physical quantities, which is represented in classical mechanics by an alge- 
bra of functions, and hence a commutative algebra, is represented in quantum 
mechanics by a non commutative algebra, such as an algebra of matrices. 
Dirac called this algebra the quantum algebra. He made the further remark- 
able discovery that in the quantum algebra the commutator ab — ba of two 
physical quantities a, b, really corresponded to the classical Poisson Bracket 
of the quantities interpreted classically, although there would be elements of 
the quantum algebra which had no classical analogs, such as spin. The struc- 
ture of the quantum algebra depended on hf, Planck’s constant, in such a way 
that when fi could be neglected, the quantum algebra became commutative. 

It must be noticed that the change in point of view from Heisenberg to 
Dirac, is dramatic and yet subtle. The Heisenberg matrices are very concrete 
and tied very intimately to very physical aspects: atomic transitions, energy 
levels, transition amplitudes, and so on. The Dirac algebra is very abstract, 
with a structure dependent on fi that becomes commutative when fi — 0. 
This extra abstraction became so characteristic of Dirac’s entire approach to 
the description of Nature that it eventually acquired a special name: the Dirac 
mode. 

If one looks back at the description of the pre quantum world, and compares 
it with the new ideas that were needed to describe the quantum world, tt 
becomes clear that what had happened was a revolution in thought, in the 
way we describe the physical world, and in the dictlonary we set up between 
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the abstract concepts and concrete aspects of the world. This is what I meant 
when I remarked earlier that we had penetrated to a new layer of reality. 

The fact that in the quantum description old pictures based on spacetime 
such as electron orbits were thrown out made the connection between reality 
and the mathematics that described it very remote from intuition and expe- 
rience. The consistent physical interpretation of the mathematical schemes 
presented deep problems, many of which were philosophical and epistemo- 
logical. Bohr and Heisenberg contributed enormously to this aspect of quan- 
tum theory. With characteristic brilliance, Heisenberg realized that there are 
truly fundamental reasons why the orbits are unobservable: classical physics 
puts severe restrictions on the precision with which we can observe both 
the position and the momentum of an electron. This is because the process 
of measurement of the position, which typically involves a collision with a 
photon, introduces uncontrollable changes in the momentum of the electron. 
Heisenberg’s analysis involved his famous thought experiment using a gamma 
ray microscope. He then elevated this argument into a far-reaching princi- 
ple: we cannot separate the observer from the phenomena that are being 
observed so that measurements of atomic systems introduce uncontrollable 
disturbances to them. His famous uncertainty principle was a quantification 
of this qualitative assumption. Bohr’s contribution was to emphasize the no- 
tion of complementarity, most clearly illustrated by the wave-particle duality 
of all matter. The difficulties of constructing a consistent theory of quan- 
tum measurement were analyzed brilliantly by von Neumann whose work 
revealed the thermodynamic nature of quantum measurement. 

The state of our description of the world of elementary particles and their 
interactions is another illustration of what I have said, perhaps even better, 
since it is still unfinished and beset with problems. In the quantum mechanics 
of Heisenberg, Dirac, Schrédinger, Pauli, and others, there was no possibility 
of deriving the Bohr transition rules with (what Wey! calls) the magic formula 


Ea = Lp = Avan, 


nor could one derive the Planck radiation formula which started the quantum 
revolution. The reason for this is easy to understand: these are features that 
result from the interaction of the atom (matter) with the electromagnetic field 
(radiation). To derive these one has therefore to set up a scheme in which 
both the atom and the radiation field are treated quantum mechanically. This 
was at an entirely new level because the radiation field is already an infinite 
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dimensional system classically. Dirac was the first person to ascend to this 
new level, and his treatment obtained not only all these formulae but also a 
resolution of one of the most vexing problems of classical physics, namely the 
reconciliation of the wave and particle aspects of light. In Dirac’s theory, the 
classical radiation field, which was a wave field, acquired particle properties 
on quantization. Attempts to generalize the Dirac theory into a systematic 
theory of quantum fields which combine quantum theory with special relativ- 
ity then encountered new problems, such as divergences. Eventually many 
of these difficulties (but not all) were resolved. Even quantum electrodynam- 
ics, as currently understood, which is regarded as the most accurate physical 
theory ever constructed, is not fully acceptable to some because of the renor- 
malization rules which are just recipes for hiding ugly divergences in the 
mathematical models that are used. People like Dirac never accepted them 
and one cannot entirely give up the feeling that the current status of this and 
other even less accurate theories is similar to the hybrid theory of the Bohr 
atom with its quantum rules. The standard model does summarize what is 
known accurately but its esthetic ungainliness appears to suggest strongly that 
it is provisional, and that new phenomena (perhaps coming out of the new 
collider at CERN) might be strong enough to suggest more radical models 
which are less divergent. 

The string theorists try to solve all the problems by a pure intellectual 
effort, akin to what Dirac did when his equation predicted anti-matter, or 
Einstein did when he invented his theory of gravitation. Only the future can 
tell if string theory will join these two as a decisive way, free of singularities, 
of looking at elementary particles and their fields, which is as beautiful and 
elegant as these two theories. 

Quantum theory inaugurated a striking departure in the way things are 
described. Before the quantum era, mathematical descriptions of physical 
phenomena followed ordinary experience and there was nothing mysterious 
about it. But quantum theory changed this in a dramatic fashion. The fact that 
the phenomena to be described were remote from ordinary experience forced 
one to invent completely new mathematical schemes that were different from 
anything that preceded them. In basic nonrelativistic quantum theory one 
needs Hilbert spaces, self-adjoint operators and their spectral resolutions, 
quantizations, spin structures, and so on. Later on, when quantum mechanics 
was combined with special relativity, the framework was enlarged to admit 
spacetime symmetry groups and their unitary representations. Quantum elec- 
trodynamics brought its own characteristic features: Fock space, creation and 
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annihilation operators, and as a culminating flourish, the concept of a guan- 
tized field and renormalization rules. The singularities that arise in the adap- 
tation of these very deep mathematical theories to understand the physical 
world ultimately drove the physicists to enquire into the very foundations of 
spacetime, and change its micro-structure from that of the familiar manifolds 
to a supercommuting manifold, and even, a noncommuting manifold. 

Some people are uncomfortable with this foray into the deepest parts of 
mathematics to explain Nature. They feel that the beauty of mathematics is 
very seductive and leads the physicist away from his true quest of describing 
Nature. I remember an occasion when we had invited Julian Schwinger to 
speak in our department. His lecture was on the Epistemology of Modern 
Physics. In a long and far-reaching discussion after the talk, Schwinger 
explained to us his view that the phenomena themselves should force us to 
the mathematical schemes that will best explain them. At that time he was 
referring to his work on the quantum measurement algebra in which he had 
shown, quite brilliantly, how the modern way of explaining quantum theory 
via vector spaces and operators is literally forced on us by the analysis of 
the Stern-Gerlach experiments (see the essay on quantum algebra). This 
is however quite opposite to the approach of the string theorists and the 
supersymmetricians. For them the idea is to perfect the mathematics first and 
only after that look tor physical interpretations that tie it to the real world. This 
is the Dirac mode mentioned earlier (see also the essay on super geometry). 

There is also some misunderstanding, mainly on the part of mathemati- 
cians, about the way mathematics is used by the physicists. It is about the 
role of rigor in physical calculations. | happen to think that rigor is not that 
crucial. The essential issue is whether the particular type of mathematics is 
the right language to use. In some sense therefore physics is more concerned 
with the formal rather than the analytical structure of things. more with the 
question whether we are using the right language, rather than in the minute 
details of fitting the language with reality. The situation is quite different 
from what it is in mathematics. Here is what Hermann Wey! says about the 
role of mathematics in physical sciences: Men like Einstein or Niels Bohr 
grope their way in the dark toward their conceptions of general relativity or 
atomic structure by another type of experience and imagination than those of 
the mathematician, although no doubt mathematics is an essential ingredient. 

There is also a philosophical aspect to this quest that has been examined 
by many people such as Weyl, Schrédinger, Schwinger, and others. The 
creations of the mathematician reflect an esthetic that is purely internal and 
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yet, miraculously, these very same constructions are precisely the ones we 
need when we try to understand the physical world. The examples are many 
and well known: 


(a) the theory of ordinary differential equations and their uncanny applica- 
tions to the Newtonian theory of celestial motion and universal gravitation; 
(b) Riemannian geometry and its use in Einstein’s general theory of rela- 
tivity; 

(c) fiber bundles, connections, and their emergence as the basic tools in 
gauge theories like Yang-Mills; 

(d) the use of spin geometry in Dirac’s equation for the electron and in 
unified theories; 


and so on. In several conversations I had with Harish-Chandra he often 
used to refer to this phenomenon as the coincidence of the inner and outer 
realities. With so many examples like these, there has emerged a point of 
view that what is not forbidden must be true and that only beautiful theories 
have a chance of being also true. Weyl, Dirac, and perhaps even Einstein, 
in his later years, were among the foremost believers in this principle. It is 
in the spirit of this philosophy that one should view the discussions in my 
essay on nonarchimedean physics. There it is not a question of immediately 
connecting p-adic mathematics with physical reality but one of building a 
structure that may have some aspects of reality in it. 

In Newtonian mechanics, and even in classical electrodynamics, the phe- 
nomena studied are for the most part close to experience, and there is no 
difficulty in understanding the forces acting on the bodies and setting up the 
dynamical schemes. But in quantum mechanics and quantum field theory 
this is no longer true. As a consequence one is often forced to use symmetry 
as a guide in the choice of the Lagrangian or Hamiltonian to set up the dy- 
namical equations. Consequently underlying all the discussions in this book 
is the massif of representation theory. It is the essential ingredient in all de- 
scriptions of symmetry and is present in all the essays in this book. My own 
understanding of physics and mathematics is built upon this theory whose 
power to influence mathematics and physics is unmatched. As an illustration 
of its power even in purely mathematical theories I discuss in one of the essays 
its application to the theory of differential equations. 
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1.2 A quantum education and evolution 


My first brush with quantum theory came in 1954 when I was an undergraduate 
in my home town, Madras (now called Chennai, with good justification), 
studying in the Presidency College for a master’s degree in Statistics. It 
was during that year that Dirac came to Madras and gave a lecture at the 
University. I had heard of him of course but had absolutely no idea of what 
he had done or why he was revered so much. I went to the talk in a big 
hall which was overflowing. I understood nothing, but one remark of Dirac 
still stands out in my mind: the world is non commutative. My subsequent 
progress in my studies had no direct connection with quantum mechanics. It 
was mainly probability theory at the Indian Statistical Institute in Calcutta. 
After my degree I went to the US, first as a post doctoral fellow to Princeton 
University in 1960. In the Fall of 1960 I went to the University of Washington 
at Seattle due to the kindness of professor Edwin Hewitt who liked some of 
the mathematics I did in my thesis. Seattle was a wonderful place for me. 
I made the acquaintances of a number of young people—Ken Ross, Albert 
Frodeberg, Roger Richardson, Bill Woolf, Albert Nijenhuis, to mention a 
few. Harish-Chandra’s suggestion that I make myself thoroughly familiar 
with the Chevalley volumes was always ringing in my years, but they were 
difficult to penetrate, especially the first volume. By the greatest of good 
fortune, Richardson, who came from the University of Michigan, had his own 
handwritten notes of Hans Samelson’s courses on differentiable manifolds 
and Riemannian geometry. They were wonderful for a beginner like me and I 
worked through these and finally began to understand what the global theory 
of Lie groups was all about. At the same time I also carefully read Dynkin’s 
famous paper on the classification of simple Lie groups through what we now 
call Dynkin diagrams, as well as Weyl’s proof of his famous character formula, 
given in his Classical Groups and Group Theory and Quantum Mechanics. 
I cannot speak enough of the courtesy and accessibility of Richardson. He 
was already a mature mathematician, with a regular position. His beautiful 
work with Nijenhuis on deformations of Lie algebras was still to come. He 
would come almost daily to the part of the department where Woolf and I had 
offices and talked about mathematics. Nijenhuis and Woolf were collaborat- 
ing on their extension of the Newlander—Nirenberg work to the C!t® case, 
and Nijenhuis would come almost daily to our little alcove. So I was in daily 
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contact with beautiful mathematics, and these influences were pivotal for my 
career. 

In Seattle two things happened: the first, a minor event, was that I picked 
up a used copy of Dirac’s book, almost new, for one dollar (I still have it); the 
second was that in the summer of 1961, Professor George Mackey came to 
Seattle as a Walker Ames Professor and delivered a series of lectures on unitary 
representations and quantum mechanics. I had by this time become very 
interested in group representations and had studied, in a desultory fashion, 
the famous Murray-von Neumann papers on Rings of operators and some 
parts of Weyl’s Classical Groups. But the connections of representation 
theory with quantum theory were not there in my mind until I heard Mackey 
explain them. For me these lectures were a revelation, a first glimpse of the 
noncommutative world. | fell in love with quantum theory, and from then 
on, quantum theory and representation theory were to be my most beloved 
interests. 

That summer in Seattle with its customary lovely rain-free weather, was a 
most idyllic time in my life. I had no duties except for teaching one summer 
course. Mackey lectured every day for four weeks. The lectures were a 
tour de force, connecting all kinds of things with a unified philosophy that 
was profound and inspiring. I attended the lectures, and talked with him 
about almost all aspects of his work. His philosopy, his perspective in both 
mathematics and physics, and his passion, all made the deepest impression 
on me. It was the single sustained personal learning experience that I have 
ever had in my entire scientific career. 

Inspired by Mackey’s lectures I hastened to read Dirac and von Neumann. 
From my point of view as a probabilist, | was most struck by von Neumann’s 
wonderful conception that in quantum theory an orthocomplemented partially 
ordered set replaced the Boolean algebra of events of classical probability the- 
ory. This structure, whose members are the experimental propositions, was 
called by von Neumann the quantum logic, and, for the standard models in 
physics, was just the projective geometry of the closed subspaces of a complex 
Hilbert space. The concept of an observable as a self-adjoint operator arises 
naturally in this context and is the proper substitute for the classical notion of 
a random variable. But, and this was the essential difference in the quantum 
case, classical calculations involving more than one random variable could 
not be carried out in the quantum world unless the variables are simu/tane- 
ously measurable. It was a beautiful theorem of von Neumann that quantum 
observables in the standard Hilbert space model are simultaneously measur- 
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able if and only if their operators commute with each other, or equivalently, if 
and only if they are all functions of a single observable. It occurred to me to 
ask if this theorem of von Neumann on simultaneous observability could be 
proved in the abstract setting of a more or less general quantum logic. I was 
able to do this in the summer of 1961. Then I went to the Courant Institute, 
where I met Warren Hirsch who was very friendly to me and we became 
close friends. He encouraged me to publish this work and I submitted it to 
the Communications in Pure and Applied Mathematics, where it appeared. 

Warren and I became close friends for we shared many common interests. I 
was very young, far away from home, knew nobody. Warren took me under his 
wing and cheered me up tremendously. I will always remember his infectious 
laugh, sense of humor, and supreme courteousness. I shared an office with 
Hermann Hannish; and Warren, who was collaborating with Hermann, would 
come in every morning and we would chat briefly of many things. I remember 
his telling me that the main qualification of Byron “Whizzer” White to be a 
supreme court justice was his ability as a football player!! I did not know if 
he was joking or serious but I enjoyed it!! 

Of all the people I met during my year at Courant (1962), Warren was the 
most vivid, the most humane, and the most interesting, to me. In retrospect 
I realize that he was extraordinarily generous, to have been so accessible 
to an unknown young mathematician (I was 25) from nowhere. He was a 
probabilist, but eventually branched into mathematical epidemiology, where 
he made fundamental contributions and became a seminal figure. 

I returned to the Indian Statistical Institute at Calcutta in 1962. I have 
spoken elsewhere’ about these years and the efforts of a few of us to build 
something lasting at the Institute. From this group only Varadhan remained 
that year. We ran a seminar where we discussed whatever came to our minds: 
quantum theory, markov processes, representation theory, and so on. Markov 
processes were Varadhan’s great love but I was more interested in represen- 
tation theory, and so we switched to that topic. As a first step(!!) we started 
a project of specializing Harish-Chandra’s work on infinite dimensional rep- 
resentations to complex semisimple Lie algebras [2]. 

For me this was the start of a long period of working on Lie groups and 
their representations, and represented a turning point in my mathematical 
education and growth. The work of Harish-Chandra and the immense effort 
required to understand it, led me into the world of semisimple groups. No 
one who has not entered it can understand its incredible beauty. Here are 
objects which are special but are beautiful to an almost limitless degree, and 
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which have moved so many people to spend extraordinary amounts of time 
investigating them. 

Why are semisimple groups so fundamental and so all-pervasive? This 
question seems to have no satisfactory answer. I remember vividly the 
Williamstown conference on representations of semisimple groups in 1972 
when Harish-Chandra was giving a series of lectures on his work on the 
Plancherel formula for p-adic semisimple groups. The lectures were built 
around what he called the Lefschetz Principle, to the effect that all primes 
must be treated on an equal footing. He had used this principle to guide him 
in his work on p-adic groups by starting from his immense insight on real 
groups (at the infinite prime) and seeing how the theory of real groups led him 
to a substantial part of the harmonic analysis of the p-adic groups. He was of 
course aware that this approach may not give everything in the p-adic case. 
To buttress his own conviction perhaps, but certainly to take the audience 
into his confidence, he recounted a story that Chevalley had told him. The 
time of the story was that of the Genesis, when God and his faithful disciple, 
the Devil, were creating the universe. God told the Devil that he (the Devil) 
had a free hand in creating whatever he wanted, but there were a few things 
that He (God) will take care of Himself; According to Chevalley, semisimple 
groups were among these special things(!). Harish-Chandra, after reciting 
this story, added, that he hoped that the Lefschetz Principle was also among 
these special things! 

My interest in the world of semisimple groups would last a long time, 
during which period the world of physics was on the back burner, so to speak. 
However events changed all this very abruptly. In 1983 Harish-Chandra died 
of a heart attack and that was an event that changed my entire intellectual 
landscape. The loss of someone who was a close personal friend as well as a 
great mentor was too much to bear and to understand. When I recovered from 
it eventually, I started to work on other themes in which I could create my 
own paradigm. The idea that I should revive my original interest in quantum 
theory was a natural one and an opportunity to bring it to the front arose 
in 1988 when I was invited to go to Genoa, Italy, to work with the physics 
group there, led by Enrico Beltrametti, and his younger associates Gianni 
Cassinelli, Piero Truini, and their students, especially Ernesto De Vito and 
Alberto Levrero. We called ourselves the quantum group of Genoa for fun 
and that visit was the first of many since then. We worked on fundamental 
questions of quantum theory, relativistic wave equations, extension of the 
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Mackey theory to orbit schemes, and so on. The interest in physics remained 
unabated after that. 

I had, by then, also started talking about physics with Don Babbitt and 
among the topics we wanted to understand was the spectral theory of first or- 
der Schrédinger systems. Gradually we realized that there was no systematic 
treatment of the foundations of first order systems, and that there was a group 
theoretic approach that appeared tantalizingly new. This was the beginning 
of my interest in the group theoretic view of differential equations and the 
starting point of a very long collaboration with Don Babbitt, my longest and 
most sustained with anyone. It led to many long papers [3] on differential 
equations with irregular singularities, and to very happy and deep interac- 
tions with the great masters of the theory such as Yasutaka Sibuya, Bernard 
Malgrange, Tosihasu Kimura, Jean-Pierre Ramis, Werner Balser, and above 
all, Pierre Deligne. Their interest and comments [4] were responsible for 
some of our best work and deepest insights, although our work touched only 
a small part of the beautiful theory of ordinary differential equations in the 
complex domain. The local reduction theory and moduli of irregular systems 
touches in a surprising manner on semisimple groups and their orbit spaces, 
not only over C but also over function fields. 

In UCLA I had the privilege of becoming friends with a great master, 
Bob Finkelstein, and learning, not only physics but also the attitudes and ap- 
proaches of physicists to physical problems. The situation became even better 
for me in the 1990s when Sergio Ferrara, one of the world’s leading authorities 
on super symmetry, accepted a permanent part-time position at UCLA and 
started to come regularly to Los Angeles. We started discussing physics and 
mathematics and that was how I began to get interested in Picard-Fuchs equa- 
tions, regular singular differential equations, their moduli and monodromy, 
and most importantly, supersymmetry. Attending Sergio’s lectures gave me 
the idea to investigate the mathematical underpinnings of unitary represen- 
tations of super Lie groups, especially super Poincaré groups. This I did 
with the Genoa group of Gianni Cassinelli, Alessandro Toigo, and Claudio 
Carmel. I must mention that the work of Kostant [5] was a great pioneer- 
ing effort in super geometry. To me however, personally it was the paper 
of Deligne and Morgan [6] that was truly inspirational and started my own 
excursions into super geometry. 

The p-adic story represents an entirely different direction. Of course rep- 
resentations of the p-adic semisimple groups have long been of interest to 
mathematicians. Already in the early 1960s, Gel’fand, Mautner, Bruhat. 
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Harish-Chandra, Jacquet, Langlands, and others saw that unitary representa- 
tions of p-adic semisimple groups had deep arithmetic significance. But no 
one has thought much of the relevance of nonarchimedean models to physics 
except Beltrametti and his collaboratiors in the early 1970s. Their view was 
that the singularities that were the plague of quantum field theory could have 
arisen from the fact that the micro-structure of spacetime was radically dif- 
ferent from what is usually assumed, for instance, that it is a real manifold. 
They had the idea that one should investigate the possibility that it is a p-adic 
manifold. This idea was revived in a big way in the late 1980s when Igor 
Volovich [7] proposed that in regions of spacetime whose sizes are of the 
order of the Planck units, no measurements are possible and so spacetime 
geometry in such regions could not satisfy the archimedean axiom. From 
the Volovich point of view it is a reasonable assumption that the micro ge- 
ometry of spacetime is non archimedean. I came across thse ideas in 1990s 
when I was already aware of the work of Beltrametti and his collaborators, of 
Weyl’s work on the commutation rules over finite rings. and Manin’s paper 
on adelic physics. My interest received a big stimulus when I went to Dubna 
and Moscow and had the opportunity of spending time with Igor Volovich 
and Anatoly Kochubei. The group-theoretic connections were (and are) es- 
pecially fascinating to me. Although the groups involved are not semisimple, 
the physically interesting cases of the Poincaré groups and the associated 
quantum field theories over p-adic fields and adelic rings present interesting 
challenges [8]. 

All of these topics are discussed in these essays. In this sense the essays 
must be read like a travelogue, an account of an intellectual journey. I mean 
this in the simplest and most naive sense: an account of some of the things 
that | came across which interested me, and which I hope will interest some 
others. The musical analogy with what I am going to say is that it is like a 
string quartet or quintet, a recital of some voices, which I, as a moderator, try 
to blend and present. 

It should be clear to anyone who has read what J have to say in this prologue 
that I have a great indebtedness to a huge collection of friends. But above all 
the greatest debt I owe is to George Mackey and Harish-Chandra. In the last 
essay I write about their work and how my personal interactions with them 
allowed me a glimpse into what may be called, without any controversy, the 


right way of looking at things. 
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Chapter 2 
Quantum Algebra* 


The ways of gods are mysterious, inscrutable and beyond 
the comprehension of ordinary mortals. 
Julian Schwinger 


Quantum algebra was created by Dirac. Its evolution also bears the imprint of the genius 
of many great mathematicians and physicists such as Weyl, von Neumann, Schwinger, 
Moyal, Flato, and others. It has inspired developments in deformation theory, represen- 


tation theory, quantum groups, and many other mathematical themes. 


2.1. The quantum algebra of Dirac 

2.2 The von Neumann perspective 

2.3 The measurement algebra of Schwinger 

2.4 Weyl—Moyal algebra and the Moyal bracket 
2.5. Quantum algebras over phase space 

2.6 Moshe Flato remembered 


2.1 The quantum algebra of Dirac 


The quotation above is Julian Schwinger’s tribute to Wey! on the occasion of 
Weyl’s birth centenary“, but it applies with even greater force to the mysterious 
way in which Dirac and Heisenberg slew all the dragons of classical physics 
and let quantum theory emerge. We can understand almost all of their thought 
processes but there will always be a residue of mystery to the moment of 
creative genius when things suddenly go to a new level of perception and 
imagination, and everything falls into its place as if by magic. 

The term quantum algebra appeared for the first time ina paper of Dirac[La, 
1b] which has now become famous. Just a few months earlier Heisenberg [2] 


* This essay and the next are based on lectures given at Howard University, Washington 
D.C., sponsored by my friend D. Sundararaman, in the 1990s. 
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had come up with the startling and revolutionary idea that in quantum theory 
physical observables must be represented by hermitian matrices which are in 
general of infinite order; and that if the physical observable x is represented 
by the matrix X, then the observable x” is represented by the matrix X Ln 
1,2,...), the n" power of X. However Heisenberg had realized that the 
matrices do not obey the commutative rule of multiplication that the classical 
observables did, and felt that this was a serious flaw in his scheme. Heisenberg 
had sent the proof sheets of his article to Fowler at Cambridge; and Fowler, 
who was at that time the thesis advisor to Dirac, passed them on to Dirac. After 
a study of Heisenberg’s paper Dirac realized that the noncommutativity of the 
quantum observables, which had appeared to Heisenberg as an unwelcome 
aspect of the new mechanics, was in fact one of its central features, and led 
to a structure for the new mechanics which was a beautiful and far-reaching 
generalization of classical mechanics, and which had the classical mechanics 
as its limiting case in the correspondence limit when i — 0. Dirac’s great 
conceptual insight was that the quantum observables belong to an algebra 
which is noncommutative but in which the commutator 


(1) XY — yx =o ay 


of two elements x. y, which measures the departure from their commutativity, 
corresponds to 
th{x, y} 


ee Ss (= dy Ox ~) 
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where 


is the classical Poisson Bracket: 
(la) [x, y] = ih{x, y} 


Implicit here is the assumption that the observables in both the classical 
and quantum theories are denoted by the same symbols but have different 
multiplicative structures. It is not clear from Dirac’s discussion whether the 
commutator is exactly equal to the Poisson Bracket, although for the position 
and momentum observables 
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Dirac postulated the exact commutation rules 
(2) [Pri gs] =—ihd,, (1 <r,s <k) 


Dirac observed that when h — 0, the right side goes to 0, so that the quantum 
algebra becomes commutative in the correspondence limit, and coincides 
with the classical algebra of physical observables. It will turn out eventually 
that the general commutator is not equal to the Poisson Bracket but coincides 
with it up to terms of order h?. 

It is to be noted that the hermitian nature of the elements of the quantum 
algebra that represent the quantum observables is only implicit in Dirac’s 
formulation. But the requirement (1a) which shows that the commutator of 
two quantum observables is not an observable but i/i times one, is clearly 
compatible only with the hermitian condition. 

The commutation rules (2), were also independently arrived at by Born, 
Jordan, and Heisenberg, and Weyl [3, 4, 5]. They give only the kinematical 
structure of the quantum algebra. To determine its dynamical structure Dirac 
proceeded as follows. The time derivatives (at a fixed time for instance) of 


the observables are maps 
dx 


dt 
of the quantum algebra into itself with the following properties: item (i) 
(ax + by)’ =ax'+by’ (a,beC) 


cy Se 


(ii) (xy)! = xy’ + x’y 


(iii) x’ is hermitian whenever x is hermitian i.e., they are derivations pre- 
serving the hermitian property. Dirac verified that in a matrix algebra every 
derivation preserving the hermitian property is of the form 


(3) xt> ila, x] 


for some hermitian element a (this requires finite dimensionality of the algebra 
and in that context it is actually a classical result going back to Skolem and 
Noether). So Dirac was led to the postulate that the dynamics of the quantum 
system is governed by an observable H such that 

,__ ax 


(4) eg a | 
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for all observables x. He naturally identified H with the observable repre- 
senting the classical Hamiltonian and postulated that it is the same function 
of the quantum positions and momenta as the classical Hamiltonian 1s of the 
classical position and momentum coordinates. This prescription is of course 
ambiguous as Dirac himself knew (see §5 of [1a]); the lack of commutativity 
of g, and p, makes it unclear what should be the elements of the quantum 
algebra that correspond to various polynomial expressions in these variables. 
For the usual Hamiltonians of the form 


» Cp? + V(q\,.--. 9k) (c, > O constants ) 


l<r<k 


there is no ambiguity. Incidentally, this point goes back already to Heisen- 
berg’s first paper (see [1b] p. 266). 

The above discussion makes it very clear that the transition from classi- 
cal mechanics to quantum mechanics consists in replacing the classical real 
commutative algebra of real physical quantities by a complex algebra with 
involution * (hereafter called the guantum observable algebra or the quantum 
algebra) whose real(=self adjoint) elements(x = x*) are identified with the 
physical quantities in the quantum theory. In his preface to the first German 
edition of his book [6] Hermann Wey] wrote as follows: 

... itcanjustly be maintained that the essence of the new Heisenberg—Schrédinger- 

Dirac quantum mechanics is to be found in the fact that there is associated with 

each physical system a set of quantities, constituting a (complex) noncommutative 


algebra in the technical mathemaitical sense, the (real) elements of which are the 
physical quantities themselves (parentheses added). 


It is to be noticed that in this initial formulation of Dirac there is no mention 
of states, Hilbert space, and so on. These would come Jater and would become 
part of the definitive framework for quantum mechanics that is due to von 
Neumann, in which states are identified with the unit vectors (up to a phase) 
of a complex Hilbert space, observables are the self adjoint operators on this 
Hilbert space, and the expectation value of an observable A in a state w is 
(Ay. w). If we restrict ourselves to bounded observables, then they are the 
self adjoint elements of the complex algebra of all bounded operators of the 
Hilbert space, thus furnishing what we might call the conventional model for 
the quantum algebra of Dirac. In illustrating the theory it is better to assume 
that the Hilbert space is even finite dimensional, as I shall do very often. 

The Dirac—Heisenberg insight raised the following absolutely fundamental 
questions. 
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1. How can one make more understandable the transition from a commutative 
real algebra to a complex algebra with involution? Note that only the 
commutator in the quantum algebra has a physical meaning (of sorts); the 
multiplication is left hanging, so to speak. 

2. How to construct quantum algebras? 

3. The quantum algebra of Dirac’s theory contains a quantity A. For cal- 
culations it is identified with Planck’s constant but in order to compare 
quantum results with classical results it is treated as a parameter, and the 
correspondence principle of Bohr is formulated as the requirement that 
when fi — 0 the quantum algebra goes over to the classical algebra in the 
limit. Is there a way to formulate this idea mathematically? 

4. To what extent is the quantum algebra uniquely determined? 


In what follows we shall explore these questions and try to come up with 
answers that are currently accepted almost universally. 


2.2 The von Neumann perspective 


When quantum mechanics was discovered, the break with classical mechanics 
was so complete, and involved such strange and unexpected relationships 
between what were thought of as well understood notions like states and 
observables, that it was extremely difficult to accept it on any other ground 
except that it worked. If we accept the complex algebra with involution 
and identify its real elements with the quantum observables, then by general 
results from functional analysis the algebra can be realized as an operator 
algebra in a Hilbert space on which the quantum observables act as bounded 
self adjoint operators, so that one is very close to the usual model. But how 
to bridge the gap from the classical to the quantum algebra? 

It was von Neumann who made the truly profound investigations that finally 
removed much of the mystery behind this extraordinary leap from classical to 
quantum mechanics. This is not the place to go into von Neumann’s ideas in 
any detail; they are discussed with his inimitable clarity and succinctness in 
his wonderful book [7] which is, in my opinion, one of the great landmarks of 
twentieth century science. They have been discussed in many places [8, 9, 10]. 
Let me therefore confine myself to a brief outline. 

The starting point of von Neumann is that associated to any physical system 
there is a structure, the set £ of all experimentally verifiable propositions. 
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This set is partially ordered, namely, that a < b if a implies b. There is also 
an involution a +» at which associates to a its negation. There are also 
some additional relations between these two concepts that flow directly out 
of the physical meaning of these concepts; J shall not consider these here 
because they are technical. The partially ordered set £ was called by von 
Neumann the logic of the system,. He observed that if the system considered 
is classical then L is a boolean algebra obeying the rules of classical logic. 
In von Neumann’s approach the fundamental fact that distinguishes quantum 
systems from the classical ones is that the logic £ of a quantum system 1s 
not a boolean algebra. For technical reasons we shall work with Boolean o- 
algebras, which are Boolean algebras where sums of countably many elements 
exist in the algebra. As long as observations are being made on a single 
observable or a set of simultaneously measurable observables, one operates 
within a single Boolean o-algebra B Cc L. However a quantum logic L 
contains many different Boolean o-algebras which cannot all be put inside a 
single Boolean o-algebra. Examples of this are the observables that refer to 
the wave and particle aspects of light or electrons, more generally observables 
which are complementary like the position and the linear momenta. One may 
say that quantum logics are composed of interlocking Boolean o-algebras, 
and the manner in which the Boolean o-algebras combine to generate L 
contains the very essence of the complementary properties of the quantum 
system. 

Clearly, in trying to understand the transition to quantum logics it will be 
important, even essential, to try to get a view of all possible logics and select 
those that are promising enough to serve as models for quantum phenomena. 
To do this satisfactorily one has to make additional assumptions that may or 
may not have good physical interpretation. Of course the additional assump- 
tions have to be such that they will encompass many models that are useful in 
quantum theory. One of the simplest classes of logics is obtained if one is pre- 
pared to admit that the logic is isomorphic to a classical projective geometry, 
namely, the partially ordered set of linear subspaces of a finite dimensional 
vector space over some division ring. The fundamental theorem of classical 
projective geometry is the statement that under very general circumstances a 
finite dimensional logic is of this form. The vector space necessarily comes 
equipped with a definite sesquilinear scalar product such that _L in the logic 
corresponds to orthocomplementation in the geometry (this is a theorem of 
Birkhoff and von Neumann, see [8, 12]. If the division ring is the field C of 
complex numbers, we obtain (after some technical arguments) the conven- 
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tional model. To get infinite dimensional models one has to drop the finiteness 
condition and use infinite dimensional analogues of the classical theorem of 
projective geometry. There are many results and this is not the place to go 
into them; see however the discussions in [8, 9]. I would like to mention 
the following: my result on geometries of infinite dimension but with atoms 
which can be identified with the geometry of finite dimensional subspaces of 
an infinite dimensional vector space over a division ring; the theorem of von 
Neumann that a modular complemented lattice of rank > 4 is isomorphic to 
the lattice of principal left ideals of a matrix ring over a regular ring (whose 
definite involutions correspond to the orthocomplementations of the lattice); 
as well as the theorem of Piron-Amemiya—Araki on the characterization of 
the standard Hilbertian logic [8, 9]. 

These results clearly show that the usual models of quantum theory where 
this vector space is a complex Hilbert space with its scalar product are not 
unreasonable at all. The road to treat the conventional model in all its details 
is now wide open; the full story is well known but technically nontrivial. In 
the context of Hilbert space it requires the Gleason theorem to identify the 
(pure) states of the system with the points of the projective space of the Hilbert 
space, so that the superposition principle of states that is emphasized by Dirac 
to be the corner stone of quantum theory is nothing other than the fact that the 
states correspond bijectively to the points of the projective space of the Hilbert 
space, and that a state o is in superposition with two others o;(i = |, 2) if and 
only if the corresponding point p, is on the line in the projective space that 
joins p,, and p,,. The linear structure of the symmetries in quantum theory is 
a direct consequence of the classical theorem in projective geometry that all 
automorphisms of a projective geometry are induced by linear or semilinear 
transformations of the underlying vector space. This contains the theorem of 
Wigner that quantum symmetries are induced by unitary or antiunitary maps 
of the Hilbert space (see [13] where this is proved by elementary arguments 
without appealing to the fundamental theorems of classical projective geom- 
etry). An observable x is identified with the map E +—~ x(E) where x(£) 
is the experimental proposition that the value of x is in the set E, thus, a 
o-homomorphism from the o-algebra of borel subsets of the real line into 
the logic. In the Hilbert space context, one can use the spectral theorems of 
Hilbert and von Neumann to identify these in turn with self adjoint operators 
on the Hilbert space. In particular, the bounded observables are the self ad- 
joint elements of the complex algebra of all bounded operators of the Hilbert 
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space whose involution is the adjoint operation, thus finally coming to the 
conventional model of the quantum algebra of Dirac—Heisenberg. 

In very general mathematical terms, there are three objects: the vector 
space over C with a Hilbertian structure, the orthogonal projections in this 
vector space, and the algebra of operators on this vector space. Classical 
geometry shows that these structures are all equivalent: from the algebra 
one can derive the projections as generators of left ideals of the algebra. I 
personally believe that this is one of the the most direct routes to the mystery 
of the quantum algebra. 

The simplest model of a quantum algebra is the algebra A of linear oper- 
ators of a N-dimensional complex Hilbert space 1 with the adjoint * as the 
involution. The quantum observables are then the self adjoint operators of 
H, and the values of the observables are the eigenvalues of these self adjoint 
operators. Every observable has at most N values, and generically, has ex- 
actly N distinct values. We can thus think of this algebra as the quantization 
of the classical algebra of functions on a set having N elements. If A is an 
observable, then by the spectral theorem, if a; are its distinct eigenvalues and 
P, are the orthogonal projections on the eigenspace corresponding to the a;, 
the observable has the representation 


A=) aP, 


The case N = 2 is especially important as it is the simplest. It arises when 
one is interested in studying spin or polarization experiments. The spin refers 
to the magnetic moment and historically goes back to the first experiments 
performed by Stern and Gerlach; we shall come to this aspect a little later when 
we discuss Schwinger’s extremely original approach to the mathematical 
structure of the observable algebra of quantum kinematics. 

In summary, the von Neumann analysis shows that under very general 
mathematical conditions there are essentially only two models for the set of 
observables of finite physical systems: the classical one of real functions on a 
set of N elements, or the quantum one of self adjoint elements of the complex 
algebra with involution of all operators on an N-dimensional Hilbert space, 
with the adjoint as the involution. It must be noted however that neither the 
complex (i.e., not self adjoint) elements nor their product in this complex 
algebra have any physical interpretation. 
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In my opinion it was only when Schwinger re-examined the foundations of 
the theory of measurement in the 1960s that a convincing physical interpre- 
tation of the full complex quantum algebra and its involution was discovered. 
It turns out that the complex algebra is actually the algebra of quantum mea- 
surements. Indeed, one starts by introducing measurement symbols for the 
acts of measurements. The product of the symbols corresponds to successive 
measurements, and the involution to the performance of the measurements in 
the reverse order. The symbols have certain natural relations among them- 
selves, and so one can introduce the measurement algebra as the algebra 
generated by the measurement symbols with these relations. Under suitable 
restrictions this algebra is in fact the quantum algebra. 

Finally, let me remark that there is the question that has always aroused 
great interest in the foundations of quantum theory, namely, to understand 
the role of the complex numbers. Since all physical quantities are real why 
should one work over the complex numbers? There is no definitive answer to 
this question, except that in almost all parts of the theory it is impossible not to 
work over C. Things like matter—antimatter duality, charge conjugation and 
so on, cannot be discussed without the availability of complex conjugation. 
Also, even if one starts over the real numbers one can always complexify, and 
so real systems can always be thought of as complex systems that commute 
with a conjugation operator. There has been nevertheless some interest in 
pursuing this matter. See for instance [18], pp. 194-198, and the references 
therein. 


2.3 The measurement algebra of Schwinger 


The point of view of Schwinger is most beautifully expressed in his own 
words [14] (pp. 1-2): 


The classical theory of measurement is implicitly based upon the concept of an 
interaction between the system of interest and the measuring apparatus that can 
be made arbitrarily small, or at least precisely compensated, so that one can speak 
meaningfully of an idealized experiment that disturbs no property of the system. 
The classical representation of physical quantities by numbers 1s the identification 
of all properties with the results of such nondisturbing measurements. It is char- 
acteristic of atomic phenomena, however, that the interaction between system and 
instrument cannot be indefinitely weakened. Nor can the disturbance produced by 
the interaction be compensated precisely since it is only statistically predictable. 
Accordingly, a measurement on one property can produce unavoidable changes 
in the value previously assigned to another property, and it is without meaning to 
ascribe numerical values to all the attributes of a microscopic system. The mathe- 
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matical language that is appropriate to the atomic domain is found in the symbolic 

transcription of the laws of microscopic measurement. 

The starting point of Schwinger’s analysis was the famous Stern—Gerlach 
(SG) experiment on the magnetic moment of atoms. Let us recall] that the 
experiment consisted in heating small pellets of silver to a very high tempera- 
ture in aclosed oven so that the pellets vaporize. Then the vapor is drawn out 
through an aperture in the oven and made into a thin beam by passing through 
a number of narrow slits. The high temperature of the vapor guarantees that 
the magnetic moments of the silver atoms in the vapor are completely ran- 
domly distributed. The atoms are then forced to pass through a magnetic field 
and then stopped by a screen. One would expect the atoms to be deflected 
differentially and in a continuous manner depending on the values of their 
magnetic moments, and so it would be natural to expect to get a uniform 
spread of deposits. Instead, what one finds are two dark spots, sharply local- 
ized, and nothing more. One is forced to the interpretation that the apparatus 
measures whether the magnetic moment ts either “up” or “down” (these be- 
ing defined as the direction of the magnetic field or its opposite), and that the 
disturbance caused by the apparatus has forced the atoms to split into two 
ensembles whose magnetic moments are either up or down. 

Schwinger’s idea was to build a theory of quantum kinematics from a gen- 
eralization of this example. For the SG experiments the observed results are 
either +1 or —1, referring to the magnetic moment being either in the direction 
of the magnetic field or opposite to it. The directions of the magnetic fields 
are of course completely arbitrary, and one can create a measurement scheme 
where the atomic ensemble has all its atoms with their magnetic moments 
aligned one way and then pass through other measurement apparata whose 
magnetic fields are at arbitrary angles to this direction of magnetic moment: 
furthermore, one can repeat such measurements many times in succession. 
The statistical behavior of the atoms then lead to observable effects which 
completely describe the kinematics of this system. 

To build the structure of the kinematics of general (finite) quantum systems 
Schwinger considers'*:!5-!® a quantum system analogous to the spin system 
of SG experiments where a generic observable has N distinct values (for 
the actual SG case N = 2). Following Schwinger we consider a class of 
generalized Stern—Gerlach experiments in which one makes measurements 
on the system. Typically, a measurement scheme receives ensembles and 
measures the values of an observable A and then sorts them out according to 
the values of A. Various types of measurements can now be considered. If A is 
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an observable with distinct values a), ao... ., ay, then one can conceive of an 
apparatus that allows only ensembles with the A-value a; to enter and permits 
only those ensembles to emerge which have the A-value a;; let us tentatively 
assign the symbol M(a;,a;) for this measurement scheme, a special case 
of which is the symbol M(a;.a;) for the measurement that permits only 
those with A-value a; to enter and allows these to emerge. More generally, 
let A, B be two observables, in general incompatible, with values (a;), (b;) 
respectively; then M(a;. b;) is the tentative symbol for a measurement scheme 
that permits only the ensembles with the B-value b; to enter and allows only 
those with the A-value a; toemerge. When such measurements are carried out 
in succession, say M,. M>,.... M,, the composite measurement is denoted 
by M,M,—,...M2Mj,, the order of measurement being from right to left. 
The measurement symbols therefore form a monoid with a multiplication, 
and admits a natural involution * defined by 


M(a,b)' = M(b,a), — (M,M,_1...M2M\)" = M|M)...M 


t 


To construct the full quantum algebra one has to proceed to specify the rule for 
multiplying these measurement symbols. Schwinger does this and discovers 
that the measurement algebra, namely the algebra generated by the measure- 
ment symbols, is canonically isomorphic to the algebra of all operators in H 
in such a manner that the observables can be identified with the self adjoint 
elements and the involution with the adjoint. 

I have said that the symbols M(a, b) are only tentative. Before defining 
them formally let me follow Schwinger and discuss them a little more. Let 
us first consider the case when there is only one observable and consequently 
only the symbols 

M (a;, a;) Us =o) 


From the physical interpretation it is clear that we should have 
M(a;.aj;)M (ay. an) = 8(J, k)M (aj. Qn). M(a;. aj)’ =a ;.0;) 
The relations 
M (aj, a;)” = M(a;, aj), M(a;,a;) = M(aj, a)" 


which are special cases of the above relations are essentially a formulation 
of the obvious fact that if we pass an ensemble through the measurement ap- 
paratus symbolized by M(a;. a;) twice we get the same result as by doing it 
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just once. Analogous interpretations lead to the more general rules involving 
all the M(a;,a;). The symbols | and 0 represent measurement schemes that 
permit all ensembles to go through without any selection and schemes that 
permit no ensembles to enter respectively. A measurement scheme that per- 


mits ensembles only if their A-values lie in a subset E of the set {a),..., an} 
is denoted by 

ye M (q;, a;) 

icE 


If we take E to be the entire set of values of A we should get 


>| M(a;, aj) = 1. 


Let us now consider measurements involving two observables. The se- 


quence 
M (a;, a;)M(b;, bj)M (aj, a;) 


is particularly interesting. It is just the a;-measurement except for the interpo- 
sition of the b;-measurement. From considerations of actual SG experiments 
one expects only a positive fraction of the ensembles to emerge from this 
scheme; if p(a;, b;) is this fraction, Schwinger postulates that 


M (a;,a;)M(b;, b;)M(a;, a;) = p(a;, b;)M (qj, a;) 


and interprets p(a;, b;) as the probability of finding an individual member of 
the ensemble in the state a; when we know before the measurement that it 
was in state b;. The matrix (p(a;, b;)) is stochastic, namely, 


p(ai,bj)>0, > pla, bj) = 1. 


It is at this stage that Schwinger takes a bold step. He does not explain 
this in [14] or [15] but refers to it as a “leap of imagination” in [16]. He 
thinks of M(a;,a;) as a 2-step process: first the entering aj-ensemble is 
annihilated and then the emerging a;-ensemble is created in its place. So he 
writes M(a;.a;) as a product of two new symbols (a;| and |a;): 


M (a;, a;) = |a;)(a;| 


where (aj| refers to the annihilation of the state where A is a; and |a;) refers 
to the creation of the state where A is a;. The act of measuring A then leads 
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to the plausible interpretations of the new symbols as follows: 
(ajlaj) = (i, jf), Ala;) = a;|a;) (aj|A = a; (aj| 


It is clear now that |a;) and (a;| have to be interpreted as vectors, and indeed 
as dual sets of vectors. Actually, to prepare the reader for this leap, he writes 
|a, b| in place of M(a, b) and the leap consists in splitting the symbol as the 
product |) (|. The observable A then gets identified as 


A= )Vaila;)(ai 
and we have, for any polynomial function f(A), the same formula 


F(A) => flai)la;) (ai 


Proceeding formally, the definition of multiplication of the symbols when 
two incompatible observables A and B are involved takes the form 


M (qj, 4;)M (by, bm) = \aj) {aj |1be) (Bm) = (ajlbe) lai) Om! 
An actual calculation with the multiplication rule defined above shows that 
P(qj, bj) = (aj|bj) (bj \a(). 


Since probabilities are to be nonnegative, it is natural to ensure this by requir- 
ing that 

(a; |bj) = (bj lai)". 
Notice now that p(a, b) becomes symmetric in a and b, and hence that the 
matrix (p(a;, b;)) becomes doubly stochastic, i.e., 


p(a;, b;) = p(bj, ai), Y= plai, bj) = Y= pai, bj) = 1. 
i J 


From this point on the treatment is essentially formal and Schwinger reaches 
his results mentioned above. 

One should of course study algebras with symbols such as the M(a, b). 
We shall do this presently. But before going in this direction it may be of 
interest to exhibit the measurement symbols in the conventional model. The 
measurements symbols will be parametrized by the unit vectors rather than 
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the states themselves. The most general measurement symbol is 
M(w, 0) 


where w, 6 are unit vectors in #. It represents the measurement where only 
ensembles in the state defined by 6 are admitted and only those in the state 
defined by wy emerge. For brevity we write [9] for the state defined by the 
unit vector 9. The adjoint is given by 


(1) M(w,6)' = M(6, ). 


To take care of the redundancy of phase factors in defining the states we 
require 


(2) May, 0) = AM (W, 6) (A| = |v] = 1). 
The multiplication is given by 
(3) M(W,@)M(p,t) = (6, p)M(y, Tt). 


Finally if (y;) is an orthonormal frame, 
(4) > MMi. Wi) = 1. 


The Schwinger measurement algebra is the complex algebra M/ generated by 
the symbols M (yw, 6) with the relations (2),(3) (4). It is clear that this algebra 
has an involutive conjugate-linear antiautomorphism satisfying (1). We write 
this involution as 7. 

What ts the structure of this algebra? We first construct a homomorphism 
of M into the algebra L(H) of all linear operators of H. Denote by 


Lye = (W, 9 EH, |IWI] = [16l] = 1) 


the linear operator 
Lya@e i UH > (0, vv). 
We must remember here our physicists’ convention regarding the scalar prod- 


uct (-). Then the homomorphism is given by 


M(w.0) +> Ly gu. 
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To verify that this defines a homomorphism of M into L(H) one must check 
that the Lg, satisfy the relations (1)—-(4) where we write Lygu in place of 
M(w, @); this however is trivial. Thus we have a well defined homomorphism 
of M into L(H) that takes + into the adjoint. It is obvious that this is surjective. 
The theorem of Schwinger is that it is injective also so that it is a canonical 
isomorphism. 

To prove the injectivity it is enough to prove that dim(M) < N? since by 
the surjectivity we know that the dimension has to be > N?. For this purpose 


we fix an orthonormal frame (y;) in the Hilbert space H. To the frame (w;) 
we associate the N* symbols 


M (Wi, Wj) (<i,j7 SN). 


We now have, for any unit vectors 6, t in H, 


M(6,t) =) MMi, iM, t) )) MH), Wj) 
i j 
= Ve. WD. OM. Ws) 
ij 
which shows that all the (6, rT) are in the linear span of the M(w;, yj). On 


the other hand the rule for multiplication for the symbols, namely, 


MW, wp)M(vx, Wm) = 6), k)M(wWi, Wm) 


shows that this linear span is a subalgebra of M. Thus 
M= Y CM, Wj) 
ij 


showing that 
dim(M) < N°. 


This finishes the proof. We have thus obtained 


Theorem (Schwinger). The measurement algebra is naturally isomorphic 
to L(H) via the map 
MW, 0) +> Lyee 


In this isomorphism + goes over to the adjoint. 


One should note that the canonical isomorphism 
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M(y, A) — Lye 


is essentially the “leap of imagination” of Schwinger that identifies M(y, @) 
with y @4@ interpreted as an element of L(H), namely Ly gu. One should also 
note that the measurement algebra is spanned by the N? symbols associated to 
any orthonormal frame. We have thus completed our program of giving a full 
physical interpretation of the complex quantum algebra and its involution. 

The above discussion does not go very much beyond what Schwinger did. 
Accardi carried out a much more penetrating analysis in [17]. I shall now 
give a brief description of some of Accardi’s ideas and results. 


The work of Accardi 


The starting point for Accardi is a not necessarily finite dimensional +-algebra 
A over the reals R with center K, and certain additional axioms. If A is such 
an algebra, a projection in A is an element X such that X = X* = X?. 
A partition of unity is a family (A,) of projections such that A, 4 O and 
Se A, = 1; sucha family is maximal if the linear span of the A, (which is 
an abelian subalgebra) ts maximal in A, namely, if B €¢ A. BA, = A,B for 
all a implies that B = }°¢, A, for suitable c,, € K. A is a Schwinger algebra 
if there is an integer 7 and a set T of maximal partitions of unity, 
(A(x)) = (Au(2)) 2 (x éT) 


a<n 


such that A 1s generated by the A, (x)(1 <a <n.x € T) over K. The integer 
n occurring in the definitions could be oo. A is a Heisenberg algebra if, first 
of all, it is a Schwinger algebra with |T7| = 2, and, writing the generating set 
as (A,,), (By), the A, By, is a basis for A over K. One can then show from the 
axioms that in a Schwinger algebra A we have 


Ag (xX) Bp(y)AaQ) = pan(X, y)Aa(x) (C20, ba coe). 


where pup(x, y) € K, pap(x, y) > 0 with 
Pab(X, Y) = Phaly, X), > Pab(x, y) = 3 Pab(X, y) = 1 
a D 


so that the matrix P(x.) = (pay(x, ¥)) is doubly stochastic. It is called the 
transition matrix between A(x) and B(\). The additional axioms I mentioned 
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earlier are needed to guarantee the positivity of the p,»(x, y). Here positive 
elements of A are those which are sums of elements of the form A* A. 

Accardi’s main concerns are the following: (a) given a doubly stochastic 
matrix with entries in K, when does there exist a Heisenberg algebra with 
this transition matrix? (b) given a family of transition matrices P(x, y)y yer; 
when does there exist a Schwinger algebra with these transition matrices? (c) 
How do the conventional Hilbert space models fit into this context? For (a) 
he finds, under suitable assumptions on K and the genericity condition that 
Par > 0 for all a, b, that the necessary and sufficient condition is that there is 
a unitary K-valued matrix U = (u.,) such that pup = u* Uap = \u»{> for all 
a, b. For (b) his condition (always assuming that the transition matrices are 
generic in the above sense) is that one should be able to choose unitary U (x, y) 
generating the P(x, vy) such that for all triples x, y,z € 7, we have (what 
he calls the Schrédinger equations) U(x, z) = U(x, y)U(), z). The Hilbert 
space models are then constructed assuming that K = C, from these unitary 
matrices. The beautiful aspect of Accardi’s theory is that the probability 
amplitude u,, appears in a natural way, and transition probabilities are the 
absolute squares |t»|° of these amplitudes, a philosophy that is the corner 
stone of physics from Dirac to Schwinger to Feynman. 

We shall now outline another approach to obtain at least some of the results 
of Accardi. We shall work from the very beginning with complex algebras 
equipped with an involution * (*-algebras). The involution is conjugate linear 
and reverses the order of multiplication. Let then A be a finite dimensional 
+-algebra of dimension N. We denote by ¢ (resp. r) the left (resp. night) 
regular representation of A; it is faithful. Let 


poo ir L(x). 


Then 
bx y) 1 (yx) 


and 
wat) = 1") 


defines a hermitian form on A. Let A be a Heisenberg algebra with respect 
to two maximal partitions of unity, (A,,). (Bp) generating A over C. Then, 
following Accardi, we have the structure constants vy of the algebra A, 
defined by 
ee cd 4 B 
Bp Aa = De cPd 


cd 


a2 2 Quantum Algebra 


where a, b, c, d etc go from 1 ton. From this formula we get 


AG, Ap — Aes 
d 


so that, since )>, By = 1, we have 
2 PabAa Ba = 3 vit Aa Ba 
d d 


giving 
Vie = Pab (1 < d < n). 


We shall now compute explicitly (x, x) = t(x*x) for 
x= DoaaiAe By. 
ab 


Then 


x*x= Y > XapXed By AaAcBa = ) Ftica Beas By. 
abcd abd 


Hence, as +(B,A, Bg) = t(Ag Ba Bp) = Spat (BpAq), we have 


t(x"x) =) XapXadt (B,AgBz) = Yo [eal #UBpAZ). 
abd ab 


To calculate t(B,A,) we note that €(B,A,) kills A.B, if c 4 a, and 


€(ByAq)(AaBa) = BpAuBy =) yi AuByBa = )~ vit! Au Ba: 


uv 


Hence 
iA ye 
d 


Thus 
FG x) Sl! > peelicenles x= y XapAa By. 


ab ab 


So the Schwinger-Accardi condition 


Pap = 0 


is equivalent to the invariant condition that (x, .) is nonnegative definite. The 
genericity condition that 
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Pab > O 


is equivalent to the invariant condition that (x, x) is positive definite.. 

We thus define a premeasurement algebra as a finite dimensional *-al gebra 
whose natural hermitian form is nonnegative. A *-representation of A is a 
representation p of A ina finite dimensional Hilbert space such that p(x*) = 
p(x)* for all x € A. A measurement algebra A is a premeasurement algebra 
whose natural hermitian form is strictly positive definite. Our aim now is to 
show that measurement algebras are nothing more than direct sums of full 
matrix algebras. The results follow simply from the classical theory of finite 
dimensional algebras (Wedderburn) and so we will be brief. 

Suppose that A is a measurement algebra. From r(xy*) = t(y*x) we get 
(x, y) = (y*, x*), and from t(xyz) = t(yzx) we get 


(ax, y) = (x,a"y), (xa, y) = (x, ya"). 


In other words, if we view A as a Hilbert space with scalar product (x, y), 
both the left and the right regular representations are *-representations. If M 
is an ideal (resp. left, right, two-sided), so in M+. Moreover, the action of A 
on M (resp. left, right, or two-sided) is a *-representation. So we can write 
A as a direct sum of minimal two-sided ideals M;. 

Let M be a two-sided ideal of A. Then M* is also a two-sided ideal and 
A = M @®M_°. Let P be the orthogonal projection from A to M and let 
P| =e € M. Since P commutes with left and nght multiplications, we 
have P = £(e) = r(e); moreover, as P = P* = P? we see that e is in the 
center of A and e = e* = e’. If now x € M, we have x = ex so that, taking 
adjoints, x* = x*e, showing that x* € M. Hence M is a *-ideal, 1.e., it is 
closed under x. In particular we may regard M as a nondegenerate *-algebra 
with e as its unit element. 

Consider the orthogonal decomposition A = 4; M; into minimal two-sided 
ideals. Let us fix i and let us take a minimal left ideal N C M;. Then € acts 
on N as an irreducible *-representation. Hence, ¢ is a *-homomorphism of A 
onto the x-algebra End(N). Now for x € M;, y € M, xy lies in Mj) O Mp 
so that xy = 0. Hence M;* is contained in the kernel K of € acting on N. 
But K 4A and so, as M; is minimal, K 9 M; = 0, showing that K = M+. 
We may thus regard the left regular representation of M; acting on N as a 
*-isomorphism of M; with End(N). We thus see that a measurement algebra 
is a direct sum of matrix *-algebras. The converse is trivial. If the algebra is 
simple, or if it is central, 1.e., its center is C, then it is a full matrix *-algebra. 
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Since it is obvious that that direct sums of full matrix algebras are Schwinger 
algebras, we have justified our definition of measurement algebras. 

The generic Heisenberg algebras of Accardi are simple measurement 
algebras. If M is a nonzero two-sided ideal and contains the element 
a Pars XapbAa By, where xeg # 0, then ApxBy = XedAcBy € M so that 
A. By € M. Hence A. BygAc = PeaAc € M, showing that A, € M, hence 
B, A.B, € M, hence finally that B, € M. Summing over b we get 1 € M. 
Hence for these +-algebras we have a Hilbert space model for A. 

Suppose A is a full matrix x-algebra. If (A,) is a maximal partition of 
unity, we have an ON basis of M, say (e,) such that A, are the orthogonal 
projection M —> Ce,. If we have a family (Aq(x));xer, we have a family 
of ON bases (e,,(x))ver and it is clear that pap(x. y) = |uap(x, y)|? where 
U, vy) = (uaa(*, y)) is umitary Within ("fealty ren) en 


U(x, x) = 1, U(x, y)U(y, z) = UG, z) Gu 2 <1). 


Conversely, suppose we are given a family (U(x, y))y.yer Of unitary matrices 
satisfying these relations. We select an n-dimensional Hilbert space M and 
an ON basis (e,, (xo) of it for a fixed x9 € T and define 


€a(x) = U(x0, x)" ea(x0). 


As U(x, y) = U(xo, x)7!U (xo, y) we find that U(x. y)e,(v) = eg(x) and 
Pab(X, Y) = |(€q(x). e,(y))|?.. This completes the treatment of Accardi’s 
theory in the finite dimensional nondegenerate case. 

If A is only a premeasurement algebra, (x,x) is only nonnegative. The 
null vectors form a two-sided ideal N and A/N is a measurement algebra. 
The above analysis applies to it. 

It may be of interest to extend this analysis to the infinite dimensional 
context that Accardi considers and compare his treatment to the limits of finite 
dimensional measurement algebras that Schwinger treats in his papers!*. 

Let us now conclude by discussing briefly a class of finite models. 
They are constructed out of finite abelian groups and go back to Weyl and 
Schwinger®'?. Let G be a finite abelian group and let G be its dual group. 
The Hilbert space of the quantum system is H = L?(G), the space of all 
complex functions on G with the scalar product 


; l — 
‘ey = ia] >) fOR® 


xeEG 
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Then there are two sets of orthonormal bases for 1, the basis of delta functions 

1/2 : gees 
(\G['?S,).<c and OO ec Let A and B be two observables with distinct sets 
of values (a,),eg and (b,) yee and corresponding eigenvectors as the two 
bases. Then 


1 
Gil3., eee 
KIGI"75,, OP =~ 


so that 
l 
M(a,)M(b,)M (ax) = yy elas) 


In other words, in a state in which A has a sharp value, when we make a mea- 
surement of B and then see what is the A-value is after the B-measurement, 
we find that all the A-values are equally likely. Thus all information about 
the first A-measurement has been erased by the B-measurement. The re- 
lationship between A and B is symmetrical and it is clear why we should 
regard them as complementary to each other. The simplest case is when 
G=G= Zn = Z/NZ, the additive group of integers mod N; this was the 
case studied by Weyl and Schwinger. It is obvious that we have a nondegen- 
erate Heisenberg algebra here. 
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We now turn to the remaining questions raised earlier, namely, the construc- 
tion of the quantum algebra for quantum systems of finitely many degrees of 
freedom, its uniqueness, and its dependence on fi. The key to this investi- 
gation is the Weyl quantization map. Its construction goes back to his great 
1927 paper, explicated later in his famous book [6]. Wey! was unable to prove 
the uniqueness of the quantum algebra given the relations (2) and this was 
done by von Neumann and Stone independently [22, 23]. I shall refer to their 
works in more detail later on. 


Weyl’s projective representation and its uniqueness 


Before I start explaining the work of Weyl which is a great landmark in the 
subject, it is of interest to quote from Schwinger’s recapitulation of his own 
work on measurement algebras in [16]: 
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_.. Very early in my study of physics, Weyl became one of my gods. I use the word 
“god” rather than, say, “outstanding teacher” for the ways of gods are mysterious, 
inscrutable, and beyond the comprehension of ordinary mortals. . . 


Already, even at the very inception of quantum theory [5], Wey] had made the 
suggestion that the Heisenberg—Dirac commutation rules (2) be replaced by 
the commutation rules between the corresponding unitary operator groups 


(em). (elt) 
Let 
(5) U(a)=e4, Viby=el?? (a, bER*) 
where 
ag =aiqi t+: +a, b-p=b pi +--- + be px 
Then the relations (2) are formally equivalent to 
(6) U(a)V(b) = eV (b)U (a) 


The commutation rules (6) have the advantage of dealing with unitary op- 
erators which are better behaved than the wabounded operators that must 
represent p and q. 
The essential originality of Wey! consisted in considering not just U and 
V by themselves but to treat them both together. To this end Wey] started 
with the map 
(a, b) -> U(a)V(b) 


Although the U (a) and V(b) are unitary representations of R*, the fact that U 
and V do not commute shows that this map is or a unitary representation of 
R*. But it is a projective unitary representation, i.e., a unitary representation 
up to phase factors, or equivalently, a homomorphism of R~ into the projective 
unitary group of the Hilbert space of the system. This is quite appropriate 
since it is the structure of this projective space with its probability functional 
\(v. @)|° that determines the structure of quantum mechanics. 

Let me recall some basic facts about projective (unitary) representations. 
Given a separable locally compact topological group G, a projective repre- 
sentation of G is a map 

L:xt—>L, 
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of G into the unitary operators of a Hilbert space such that the induced map 
into the projective unitary group is a continuous homomorphism. In down- 
to-earth language this is the same as saying that 


L,Ly = A(x, y) Ly (x,y €G) 


where 
Ax, y)ET 


T being the multiplicative group of complex numbers of absolute value 1. 
The continuity assumptions are equivalent to requiring that x +> (L,wW, @) 
is Borel for all yw. 6 in the Hilbert space. The function A : G x G —> T 
is called the mu/tiplier of the projective representation. The map of G into 
the projective unitary group of the Hilbert space is unchanged if we replace 
L. by a(x)L, for any borel map A : G —> T; then d changes to x, y R> 
A(x, ya(x)a(y)/a(xy). 
Following von Neumann’? we consider the normalized map 


(7) Wi Kae) i —> ey (a) V(b) 


which is also a projective representation of R**. This normalization is also 
implicit in Weyl’s 1927 paper (see formula immediately before (52) of that 
paper). A simple calculation shows that 


(8) W(a, b)W(a',b') =m(a,b:a',b')Wiat+a’,b+b’') 


where, 1, the so-called mu/tiplier of the projective unitary representation W, 
is given by 


m(a,b: Te b’) = elit/2yNla -b—a-b') 


Wey! postulated that quantum kinematics is described completely by the pro- 
jective unitary representation W. It is clear from this that the infinitesimal 


generators Q,, P, of the unitary groups 
(Gane), «=V(O,....f...)  @isinther™ place) 


satisfy the commutation rules (2). He also showed that if one takes any 
projective unitary representation of a real vector space V of finite dimension 
d and suppose that it is faithful and irreducible, then its multiplier 7 is 
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necessarily of the form 
m(x : y) = eGA/b@y) (x,y EV) 


where f is a symplectic form on V x V. It then follows that d = 2S 
even and one can choose a linear isomorphism of V with R* x R* such that 
(identifying V with R‘ x R‘) 


Bla, b:a’,b'‘)=a'-b-ab 


This means that one can restrict attention to projective unitary representations 
W with multiplier (9). 

This was Weyl’s formulation of matrix mechanics. A special choice of W 
can now be introduced, the so-called Schrédinger model. Here the Hilbert 
space H is L?(R*) and W = W is defined by 


(10) (Wo(a, b)w)(q) = eI W(qg+hb) (qe R‘) 


Weyl then formulated wave mechanics as the kinematics determined by the 
representation Wo and raised the problem of equivalence of wave and matrix 
mechanics in the following form: is every irreducible projective unitary rep- 
resentation of R“ x R* with multiplier m as in (9) unitarily equivalent to Wo? 
It is of course quite easy to check that Wo is irreducible. 

In this form Weyl! was not able to prove the uniqueness. But he made a 
remarkable observation, namely, that if one replaces the space R‘ x R“ by 
the finite group Z, x Z, where Z, is the additive group Z/kZ of intergers 
mod k, and m by 


2rila’-b—a-h')/ 
m,(a, b -a’,b’) a e2tila b—a-b’)/k 


then the same problem of uniqueness can be formulated in this context. He 
then showed in [6] that the answer to the uniqueness question in the finite 
context is affirmative. He also suggested that in the limit as k — oo it is 
possible to view the Schrédinger model as the limit of these finite quantum 
systems, so that the uniqueness in the general case is very plausible. 

Direct proofs of the uniqueness of Weyl kinematics were constructed by 
Stone [22] and Von Neumann [23] very soon after Weyl’s paper [6] was 
published. They established uniqueness in the following strong form: any 
projective unitary representation of R‘ x R* with multiplier m defined by 
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(9) is unitarily equivalent to the direct sum of a number of copies of the 
Schrodinger model Wo. Subsequently Mackey formulated and proved the 
uniqueness theorem in the context of any separable locally compact abelian 
group, and this theorem had far-reaching consequences [24]. 


Weyl quantization map 


The concept of a quantization map is of course fundamental because the quan- 
tum Hamiltonian is obtained by applying this map to the classical Hamilto- 
nian. Preferring to work with bounded operators, Weyl defined the quantiza- 
tion map Q» ona space of sufficiently nice functions on R‘ x R“. He required 
that 

Qn: eel”? t—> Wa, b) = ef" U (a)V(b) 


and then extended it by linearity. Using Fourier analysis this means that Q»p is 
well defined for all functions on the algebra F which are Fourier transforms 
of integrable functions of the dual variables a, b. More precisely, let f = Fg 
where F is the Fourier transform map: 


f(q, p) = (Fe)(q, p) = ii e! 9+?) o(a, b)dadb. 


Then O,(f) = W(g): 
f=Fe 


(11) On(f) = |e. b)W(a, b)dadb. 


Here da etc refers to the self-dual Lebesgue measure on Ré which is (277)~*/? 
times the standard Lebesgue measure. 
The Weyl quantization map is thus 


(12) On: froWwF'f) (f€F) 


Wey]! actually did not insist that the Fourier integrals be convergent but be 
defined only in some distributional sense. This allowed him to calculate 
formally the quantization of polynomial functions in q. p. He found that 
the quantization of a polynomial is obtained by replacing q,. Pr by the corre- 
sponding quantum mechanical operators Q,. P,, and replacing any monomial 
in the g,. p, by the averages of the corresponding products in all possible 
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orderings (the Weyl ordering)(23): 
On(f) =: fq, P): 
Furthermore, it is easy to see that 
W(a, b)' = W(a, b)~' = W(—a, —b) 
so that 


[ ea b)W (a, b)dadb is self adjoint <> g(a, b) = g°(—a, —b) 


which leads to 
O,(f) is self adjoint <=> f is real 


If we make additional restrictions on f, for instance that f be smooth and 
rapidly decreasing, i.e., that f is in the Schwartz space, then Qp(f) is an 
integral operator of trace class with kernel 


(13) Kn(f)(4.9) = &1 («nrg +q'),q - a) 


where gj is the partial Fourier transform of g = F~' f in the first variable. 
Going back to W let me recall that if A is a locally compact abelian group 
and w is a unitary representation of A, then the map 


gr> wg) := fl w(x)g(x)dx 


is a *-homomorphism of the convolution algebra L'(A) into the algebra of 
bounded operators in the Hilbert space of w with the property that 


w(g*) = w(g)" 2 (x) = gen 


If now w is only a projective unitary representation with multiplier 5, then 
Weyl observed that 


gt— w(g) i= [ ecodx 
is still a *-homomorphism on L'(A)s where L! (A); is the twisted convolution 


algebra associated to 4, i.e., the underlying vector space of L'(A); is L! (A) 
with the twisted convolution defined by 
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(21 *5 82)(x) = [ eoreee — y)d(y, x — y)dy 
(14) 


= fac — y)g2(y)d(x — y, y)dy, 
while the +-property corresponds to 
w(g*)=w(g)' —_g*(x) = g(—x)3(x, —x) 
Let us now specialize this to the case when 
A=R‘ xR 5=m 


and write 
81 *m 82 = 81 *n 82 


which is justifiable since m is determined uniquely by 4. Then 


(15) (2 *p g(a, b) = fe (ay, b')exla _ a. b i b’)eth/2ab'—a"-b) ag! dp! 


Thus 
(16) QO(Fg,)Q(Fg2) = O(Fg), & = 21 *n 22 
or 
O(fi)Q(fr) = A(/f) 
where 


I (qd, P) = / fila’, Bs ila—a'. b—b')eih/2a-b'—a'b)—ila-gt+h P) dq'db'dadb 


with 
g(a, b) = [ea pei 9+’ Mdgdp 


Weyl quantization on an abelian group 


The completely group theoretic nature of his quantization of the classical 
phase space, which is so obvious in the above discussion, led Weyl to a far- 
reaching generalization of the process of quantization. He formulated in his 
book [6] (pp. 275-276) the general principle of quantization as follows. 
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The kinematical structure of a physical system is expressed by an irreducible 
Abelian group of unitary ray rotations in system space. The real elements of 
the algebra of this group are the physical quantities of the system; the representa- 
tion of the abstract group by rotations of system space associates with each such 
quantity a definite hermitian form which “represents” it. 


Here “group of ray rotations” is Weyl’s term for a subgroup of the projective 
unitary group and so is the same as a projective unitary representation. Re- 
placing the classical phase space R~ by an abelian group A we can formulate 
Weyl’s principle of quantization as follows. Let ¥ be an algebra of complex- 
valued functions on a separable locally compact abelian group A. One need 
not be very specific as to how F is to be restricted but as in most of Weyl’s 
examples we take F to a subalgebra of the algebra of functions on A which 
are the Fourier transforms of integrable functions on the dual group A. Then 
quantization is a linear and faithful map 


On: ft OQ) (feF) 


from F into the algebra of bounded operators on a Hilbert space H, the 
Hilbert space of the quantum system. If Fp is the image of F by Qn, he 
viewed Fy as the quantization of F. The map Q» was required to satisfy the 
self adjoint property, namely, 


Of) =Ondty (eF) 


This means that 7; has an involution, namely the adjoint operation *, and that 
the quantum observable that corresponds to a real classical observable f is a 
self adjoint element of Fy. 

Although Wey! treated only the cases of a vector space or a finite group, 
one can clearly work with any separable locally compact abelian group. This 
was done in my paper [25]. Let me now sketch briefly the main results. 

For any separable locally compact abelian group G let Z*(G) be the com- 
mutative group of Borel 2-cocycles with values in the circle group, B*(G) 
for the subgroup of trivial cocycles, and H*(G) = Z°?(G)/B*(G) for the 
quotient group of multipliers. For any m € Z7(G) we can give L'(G) the 
structure of a Banach algebra by defining the mvisted convolution by 


(fi *m f2)(x) = / Ai fal — y)m(y, x — y)dy (ce G) 
A 
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This algebra structure depends only on the cohomology class [7] of m, namely 
on the image of m in H7(G); and is commutative if and only if m is trivial, 
in which case it is isomorphic to the usual convolution algebra structure on 
L'(G). Let C?(G) be the group of bicharacters of G, S°(G) the subgroup 
of symmetric ones and A-(G) the subgroup of alternating ones (m(x, x) = 
1, m(x, y)m(y. x) = 1). One has C?(G) C Z?(G) and if G has no 2-torsion 
(this is equivalent to the dual group G having no 2-torsion) then C?(G) maps 
onto H*(G) with kernel $°(G) while C?(G) = S2(G) ® A2(G) so that under 
the natural map of A?(G) —> Z?(G) we have 


hO(G) SHAG) (G has no 2-torsion). 


Let now A be a separable locally compact abelian group with character 
group A. Let W(A) be the space of all functions on A which are in L'(A) and 
whose Fourier transforms are in L'(A). (Itis possible to work with the larger 
space of functions which are Fourier transforms of elements of L'(A) but 
WA) is preferable as it is mapped isomorphically onto W(A) by the Fourier 
transform, so that itis self dual.) Itis easy to see that W(A) and W(A) are both 
closed under convolution, multiplication, translations by group elements, and 
multiplications by characters; moreover, if h, k are continuous functions on 
A (resp. Ae then h *k is in W(A) (resp. W(A)). Then, for any m € Zz (A) 
W(A) has the structure of an associative algebra given by 


fiafe = 8, f= Fite 


The structure of this algebra depends only on the cohomology class of m, 
and is abelian if and only if m is trivial, in which case it is just ordinary 
multiplication. If A has no 2-torsion, we can take m € A2(A), the *-Operation 
is f re f°" on W(A), and the trivial case corresponds to n= ihe 
generic case is when m is symplectic: this means that (in the general case 
when m is not necessarily a bicharacter) that 


m(x, y) 
m(y, x) 


which is always a bicharacter, induces an isomorphism of A with A (when 
A has no 2-torsion and m is in A2(A), this is the case when m itself induces 
an isomorphism of A with A). In this case it is appropriate to call WW(A) the 
Weyl algebra of A. 
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We have already seen that when A = R~ the symplectic multiplier leads 
to the usual Weyl-Heisenberg representations. Wey] also discussed the finite 
case in [6]. However recent work in physics suggests that it may be of interest 
to consider situations where the real manifolds are replaced by p-adic ones 
which arise in number theory. This leads to what I call the arithmetic Weyl 
algebras. Anexample of these is as follows [25]. Let K be anonarchimedean 
local field of characteristic 4 2, let V be a vector space of finite even dimen- 
sion over K, 6 asymplectic bilinear form on V x V, and let x be a nontrivial 
additive character on K which we use to define the Fourier transform on the 
space S(V) of Schwartz—Bruhat functions on V. Then S(V) is the arithmetic 
Weyl algebra with multiplication defined by 


Coe / FG Pe — nx (Bor x2 aude 


The theory of quantum systems over loca! and global fields is very interesting 
but this is not the place to go into them. 


Weyl—Moyal algebra and the Moyal bracket 


Weyl’s ideas were taken up more than twenty years later in 1949 by Moyal. In 
aremarkable paper [26] Moyal made a deep study of Weyl’s quantization map. 
Moyal’s motivation was to reintroduce the classical phase space in quantum 
mechanics by identifying the quantum algebra with an algebra of functions 
of the “non commuting quantities g,, p;”. He did this by pulling back the 
operator product via the Weyl quantization map to a product operation on the 
space of functions on the classical phase space. The resulting algebra is the 
Weyl-Moyal algebra. This is of course nothing other than the Fourier trans- 
form of Weyl’s twisted convolution and so we call it the Wey/—Moyal product. 
Unlike Weyl who did not investigate the Poisson structure on the classical 
function space when it is quantized, Moyal went further and calculated the 
operator commutator in terms of this product and obtained what we now call 
the Moyal bracket on the classical function space W(R* x R*), 
The Weyl—Moyal product 


f= finh 


is defined by first convolving the Fourier transforms in the twisted manner 
and then applying the inverse Fourier transform: 
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(17) 


i Gap) = | Ala’. b') fala—a’, b—b' eth Merb’ ab) a-g+ PV ta db dadb. 


Then 
(18) 
Qn fien fo) = On( fi) Qn( fr). Cfo fom) = 1On fi) On). 


Since the map f +— Qp(f) is injective, it follows from (17), (18), and the 
associativity of operator multiplication that 


(19) Ue io ho Be fi fo fine — fea fi 


define, respectively, the structures of associative and Lie algebras on the 
classical function space W(R* x R‘). 

To explore the dependence of the algebra defined by (17) on f, one may 
proceed by expanding the exponential 


(20) e(th/2)(a-b'—a"-b) 


as a series in powers of fi. From now on we shall assume that the classical 
function space is S(R*‘ x R‘), the algebra (under both multiplication and 
convolution) of the Schwartz functions in q. p. The terms of the series can 
then be explicitly evaluated. For instance, the constant term is (f) f2)(q. Pp), 
while the next term becomes, after some calculation, 


(21) 5 PU f2)(q, P) 

where P( fi, f2) = (fi, fo} is the Poisson Bracket. Thus 
(22) finfe = fit + FP fe) + OP) 
We define the Moyal Bracket of f, and f> by 

(23) [fis fola = fin to — fanfi- 
Then 


(24) [fi. fala =iht fi, fo} + OCA). 
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This is just the Dirac prescription but with the essential refinement that 
(fi, foln is not exactly equal to ih{ fi, fo} but is only equal to it modulo 
h?. We have thus constructed the quantum algebra associated to a general 
classical system, giving meaning ex posto facto to Dirac’s profound and re- 
markable insight. We shall see presently that for the canonical variables the 
Dirac and Moyal brackets are the same, and the same is true for angular 
momentum observables and certain obvious generalizations of them. 

A similar calculation gives us all the higher order terms as well. To make 
it transparent we introduce two sets of variables 


(2 (2) 
a )p ) 


1) 1 
q' ) p' 
and the associated Fourier transform variables 


q). po). a®, b®, 


We then have the differential operator 
0 a 0 0 
SS ) (sin @ sn - sn os | 
(i) (2) I 2 
ice \8Gj” Bp; Api? Bqj” 


and let [1 be the induced operator on the Fourier transform space via the 
transform. Then IT is multiplication by the polynomial 


(25) fi= > (a)? +a). 


l<jsk 


Finally, let Res be the operator from S ik” sok ) to S(R**) which is the 
restriction to the diagonal: 


Res(f)(q, p) = f(q, p,q, p). 


Then 
P=Resoll(fi ® fa). 


We define the bidifferential operator Py by 
(26) Py : fi, f2t-—> (Reso II”) (f, @ f). 


Then the general term of the expansion of (17) in powers of h is 
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(in/2)% 
27) Pub fis fe) 


We thus have the formal expansion of (17): 


co & 33 
GH/2)-" 3! 
(28) wala 2 ——_——— P) 3 
py QM +1)! om+iC fi, f2) 
Symbolically we write this as 
; 2 h 
(29) firlh=e? LA. fala = a sin =P 


which go back to Moyal’s paper [26]. 

Let us summarize this discussion. Let S be the Schwartz space of the clas- 
sical phase space R**. Then S is acommutative algebra under multiplication. 
For any real value of the parameter /, the definition 


fine = et" 


interpreted by (17) converts S into an associative algebra denoted by S;. Then 
Sis a family of noncommutative associative algebras which have S as their 
limit when i — 0. Moreover, under the commutator product, S, becomes a 
Lie algebra whose Lie bracket becomes the Poisson Bracket on S in the limit 
when hi — 0: 


(30) LA. fale = fin fo —21-nfi =ih{ fi, fo} + OW) 


It is natural to ask if this product structure can be defined for larger algebras 
than S. This can be done at least formally by treating h as a formal parameter. 
Let us write t = ih /2, treating t as a formal parameter and define 


(31) fiiiise” = fit DSPh fi 


r>] 


where the series is treated formally. It then makes sense even when the fi; 
are in C®(R*) because the P, are bidifferential operators. The associative 
law persists in the formal sense because the identities that have to be estab- 
lished for associativity are differential operator identities that are true for all 
differentiable functions once they are true for Schwartz functions. Thus (31) 
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= as , 2k 
defines the structure of an associative algebra in the formal sense on C™(R™ ). 


We shall make this more precise in the next section. 


The quantum algebra as a limit of finite dimensional quantum algebras 


This idea, as we have mentioned earlier, was due to Weyl and taken up by 
Schwinger later [6, 19]. Kinematically this was done in great generality 
in [20]. To understand the limiting process dynamically we consider, for 
simplicity, motion on a line where the potential is big at infinity (it is enough 
that V(qg)/ log |g| — oo as |g| > 0) so that the energy spectrum Is discrete, 
as in the case of the harmonic oscillator. Indeed, taking N to be an odd natural 
number, one can identify Z, with the finite grid X(N) C R: 


ey Sica AY = iOReliviee?..... +N"} 


where 


1/2 
ev = (=) pe 2NO Ae 


The Hilbert space of the system is now L7(X(N)); the position operator 
gn is multiplication by the coordinate; the momentum operator py 1s the 
Fourier transform of gy, the Fourier transform being the one defined for the 
finite group Z(N) = Zev /N-Zen with which X(N) can be identified. The 
Hamiltonian is 


he 
Hy = 5 Py + V(qw). 


In the limit when NV — oo this system goes over to the usual system on the 
real line with the Hamiltonian 


tas 
H = =p? +V(q). 


An invariant way to formulate this limit is to ask that 
Tr(f(Hw)) —> Tr(f(H)) = (N > oo) 


for a large class of functions f for which f(H) is of trace class. See [6, 19, 
20, 21] where this is done in arbitrary dimension and for functions of the form 
e~** of the real variable x, leading to the result that the dynamical operators 
in imaginary time converge in a very strong manner: 


2.5 Quantum algebras over phase space 49 
|| exp(—s Hy) — exp(—sH)||; > 0 (N > oo) 


where ||-||, is the trace norm in the Banach space of trace class operators. 


2.5 Quantum algebras over phase space 


The Weyl—Moyal theory may be summarized by saying that quantum me- 
chanics is essentially equivalent to imposing on the classical commutative 
algebra of tunctions on the phase space a family of non commutative algebra 
structures depending on fi such that (22)-(24) are satisfied, thus giving the 
Dirac insight a very satisfactory interpretation. Depending upon the choice 
of the classical algebra, the quantum parameter fi can be analytic or formal. 
The Hilbert space of quantum mechanics now arises as the space on which 
these non commutative algebras are represented by operators. 

Mathematicians will recognize in the relations (22)-(31) the fact that the 
quantum algebra may be viewed as a deformation of the classical algebra 
with fi as the deformation parameter. If 1 am not mistaken, this point of view 
was pioneered by (the late) Moshe Flato and his collaborators and colleagues, 
F. Bayen, C. Fronsdal, A. Lichnerowicz, D. Sternheimer, and H. Basart, in a 
series of seminal papers [27, 28, 29, 30]. Although deformations of associative 
and Lie algebras had been an established field, the application of this subject 
to the foundations of quantum theory was new and was first done by them. I 
shall now try to describe briefly some of the main results in this development 
which is still very active. This has fitted well with the recent fascination 
among theoretical physicists for non commutative geometry and so there is a 
great following for this type of work. 

The calculations involving the Weyl—Moyal product and the Moyal bracket 
may be viewed as describing a class of deformations of S or C™ (R**), viewed 
as a Poisson algebra. To explain this point of view let us begin with the notion 
of deformation of associative algebras first. We take a naive point of view. 


Analytical deformations 


Let A be an associative algebra over C. Intuitively one speaks of a deforma- 
tion of A when one is given a family of associative algebras A,defined for 
small values of a parameter f whose underlying vector spaces coincide with 
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that of A such that item (i) if we write 
ds gr f+8 


for the product in A,, then the product map is nice in some sense (smooth, 
analytic, etc) 


(ii) For t = 0, f -;g reduces to fg, the product in A. 


t is called the deformation parameter. 


Formal deformations 


Here one views t as an indeterminate and passes to A[[t]] which is the algebra 
of formal power series 


f =ageeaie aoe ee (a; € A) 


Clearly the product 
i. 8 tae 


in A extends to a product 


fe 
k, = 2S fr8q 


p.q20,p+q=r 


in A[[t]] where 


We denote by 
Allt]]o 


the algebra thus obtained. A formal deformation of A is now defined as an 
associative algebra whose underlying space is A[{t]] and whose multiplica- 
tion 


figt> fug (fe € Affe) 


is such that when t = 0 it reduces to the algebra A[[t]]o. If we write 


fig=fet+tci(, +r s+... Geet) 


where 
C;:AxA—A (Co = id) 


2.6 Moshe Flato remembered 5]. 


are bilinear maps, then the associativity requirement means that the C; satisfy 
the identities 


SS Gao. c)) = = C,(C,(a, b), c) 


r>0.s>0,r+s=n r>0.s>O0.r+s=n 


Conversely, if the C; : Ax A —> Aare bilinear maps satisfying these iden- 
tities, the definition of product for two elements of A extends to an associative 
algebra structure on the vector space A[[r]] that coincides with the algebra 
structure A[[t]]o when rt = 0. The reason for introducing formal deforma- 
tions is that they are algebraic objects and so may be studied with more ease 
than analytical deformations. Moreover, any analytical deformation in which 
the product is analytic in the parameter gives rise to a formal deformation. 
However true quantization requires analytical deformations and not formal 
deformations, so that the formal deformations become more remote from the 
physical point of view. In particular, if in any given context we establish the 
existence of a formal deformation, there still remains the problem of making 
analytical sense for these formal deformations for at least a large subalgebra 
of the original algebra, if the deformation is to be given a physical interpreta- 
tion, for instance, to obtain a representation in a Hilbert space. We call such 
deformations quantizations. 

Considerable work has been done in the directions suggested by the above 
remarks. Quantizations of symplectic and Poisson manifolds have been ob- 
tained. In the most important case of R™ as the phase space it was shown 
by Moshe Flato and his collaborators that the Wey!l-Moyal deformation 1s 
essentially the only one, thus exhibiting quantum mechanics as the only pos- 
sible deformation of classical mechanics. In other words the Dirac quantum 
algebra is the only one possible. In my opinion this uniqueness theorem is 
the most fundamental foundational result in making transparent the transition 
from the classical to the quantum algebra. 


2.6 Moshe Flato remembered 


The idea that the Weyl-Moyal algebras should be treated in the broader frame- 
work of deformations of Poisson algebras was pioneered by Moshe Flato and 
his collaborators F. Bayen, C. Fronsdal, A. Lichnerowicz, and D. Sternheimer. 
Soon others such as De Wilde, Le Conte, Fedesov, Deligne, and Kontsevich 
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took up this theme and went very far with it. It is unfortunate that Moshe 
passed away at the peak of his creative powers. I want to pen a few lines in 
which I try to give a glimpse of the mercurial personality of this remarkable 
man. 

We first met in Liege in the mid 1970s in a beautiful conference on rep- 
resentation theory and fundamental physics that he organized. After that we 
always spent time together during his visits to UCLA, and once in Dijon 
when he invited me for a talk in his seminar. He told me once that had we 
met earlier we could have done much work together. When the AMS and the 
International Press published my Selected Papers, I sent him a copy with a 
letter recalling to him this remark. But unfortunately it reached him when he 
was already in a coma. 

He had a very broad view of physics and expressed his preferences openly 
without worrying whether they were politically correct. He was that rare 
person who was very strict with the big people and very gentle with younger 
and less established scientists. To illustrate his kindness to young people the 
following incident would suffice. I had a student, Luca Guerrini, working 
on deformations of Lie algebras under me and after he finished his thesis I 
asked him to write up his results and submit it to the Letters in Mathematical 
Physics, The submission was rejected but I noticed in the rejection letter a tiny 
ray of hope: it was the comment that the paper does not explain why anyone 
would or should be interested in the questions studied in the submission. I 
told Luca that I will help him write an introduction and resubmit without any 
other change. This was done and the paper got accepted without requiring 
any modifications almost immediately. Some months later I was in Paris 
and met Moshe just for a few hours. The first thing he told me was “don’t 
think I did not know that you wrote that intro.” It turned out that this was 
the last time that I would see him for he died a few months later. A few 
months earlier he had agreed to be the mentor for Charlie Conley, another 
student of mine. Moshe suggested new lines of investigations and put Charlie 
in touch with people who were doing such things. This changed Charlie’s 
entire scientific life in a dramatic manner and launched him on an extremely 
successful research career. 

When Moshe passed away, there was going on a work shop on represen- 
tation theory at UCLA, organized by Chris Fronsdal and Joe Wolf. I gave 
a eulogy for Moshe at that workshop, which was later published in one of 
the volumes of the Proceedings of the Conference that was organized in his 
memory (by Daniel Sternheimer and others). I reproduce it here. 
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Moshe Flato (1937-1998)! 


Moshe Flato died on Friday, Nov 27, 1998, in Paris, of complications from 
a massive cerebral haemorrhage he suffered a couple of days earlier. His 
shocking and untimely death is a tremendous loss to the world of mathemat- 
ical physics, and even more so, to his family and large circle of friends and 
admirers. He was buried in Tel-Aviv. 


Moshe’s work covers a large spectrum of high energy physics, straddling 
diverse aspects of quantum field theory and elementary particles. His interest 
was always in fundamental questions. He was constantly searching for a 
conceptual understanding of the strange world of elementary particles, and 
explored many new ideas and ways of thinking about them. He created a 
school in Dijon and Paris and shaped the intellectual developments of a huge 
number of young scientists, some of whom became his closest friends, such 
as Daniel Sternheimer and Jacques Simon. 

I would like to make some very brief comments on his work. In quantum 
mechanics there is a famous uniqueness theorem due to von Neumann and 
Stone, which says that any system of operators satisfying the Heisenberg— 
Wey] commutation rules is isomorphic to the standard system which is called 
the Schrodinger model. The content of this theorem is that there is only 
one way to quantize a classical system with the specified commutation rules, 
namely through canonical quantization. In a famous paper published in the 
mid 1970’s, Moshe, and his collaborators Bayen, Fronsdal, Lichnerowicz, 
and Sternheimer, proved a wonderful variant of this uniqueness in a very 
different context. Their idea was to think of quantization as a deformation 
of the Poisson algebra of classical observables. Canonical quantization fur- 
nishes such a deformation, the Moyal-Wey] deformation. They proved that 
this is essentially the only deformation of the classical Poisson algebra. In 
my opinion this uniqueness theorem lies at a much greater depth than the von 
Neumann-Stone theorem and may be thought of as the definitive clarification 
of the questions that were raised first by Hermann Wey] in 1927 regarding 
the foundations of quantum physics. The ideas coming out of this paper rev- 
olutionized quantization theory, made it possible to think of quantization in 
contexts where canonical quantization does not make sense (curved space- 
time, constrained mechanics, etc), and led inevitably to the theory of quantum 


| Talk delivered on Dec 5, 1998, in the Lie group workshop held at UCLA on Dec 5-6, 
1998. 
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groups and quantum homogeneous spaces. Recent work on g-quantum me- 
chanics by Finkelstein and others would not have been possible without this 
seminal achievement. During the 1980's, Moshe, in collaboration with Frons- 
dal, made a deep analysis of the representation theory of the conformal group. 
Their theory of the so called singletons, which was very much ahead of its 
time, has recently come to the forefront in high energy physics. 

Throughout his life he was deeply interested in formulating and under- 
standing the nonlinear aspects of quantum field theory. He called this the 
“nonlinear program” and wrote many papers on this theme, with Simon and 
others. His recent work in collaboration with Simon and Taflin is a technical 
tour de force on the Cauchy problem and scattering theory for the coupled 
Maxwell and Dirac equations that arise in quantum electrodynamics. 

He loved the subject of mathematical physics deeply and was very unhappy 
that (in those days) there was no forum where important and interesting dis- 
coveries in mathematical physics could be published quickly. The physicists 
had several such journals, but it was often very difficult to get papers pub- 
lished in them because of lack of sympathy of the physics community toward 
major discoveries that were more mathematical and did not lead to immediate 
physical interpretations. Moshe understood this situation and with charac- 
teristic energy and enthusiasm founded, in 1977, the Letters in Mathematical 
Physics precisely to correct this anamoly. The Letters has become one of the 
premier journals in mathematical physics. If you look through its pages you 
will find all the major developments in the last two decades represented there 
when they first came up. There is no question that he was the moving spirit 
behind the Letters. This is one achievement that will live long after him. 


But bigger than all of his achievements in science is the fact that he was a 
decent man. Behind his rough exterior lay a heart of extraordinary generosity. 
He was lavish in giving his time to others. I learned a lot of physics by talking 
to him and reading his papers. He was especially generous to young people, 
understood their difficulties at the start of their scientific careers, and helped 
them in every possible way. He treated his friends royally, and influenced their 
lives in ways that cannot be enumerated. He was one of the few men I have 
known of whom it could be said that he was larger than life (a comment also 
made by one of the young scientists who was deeply influenced by Moshe). 

As one gets older, one finds invariably that one cannot do all the things 
that one did when one was young, and certainly not at the depth that was 
reached in youth. The natural tendency to compensate for this is to reduce 
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one’s ambitions and do what one can to advance science. Moshe scorned 
this way of living. He pushed himself and others around him without any 
slackening and lived on the creative edge all the time, right up to the moment 
of his death. According to the Hindu view of life this is one of the noblest 
ways to live and die. 

When I learned of Moshe’s death I was reminded of Isaac Stern’s words 
on the occasion of David Oistrakh’s death, when he referred to Oistrakh as a 
golden man. Moshe was a golden man. 
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Chapter 3 
Probability in the quantum world* 


When you watch you find that the electron goes either 
through one hole or the other, but if you are not looking 
you cannot say that it either goes one way or the other! 
Such is the logical tightrope on which Nature demands 
that we walk if we wish to describe her. 

Richard Feynman 


Probability has been a part of quantum theory from its very beginnings. The original 
probabilistic interpretation of quantum mechanics was put forward by Max Born. How- 
ever the idea that the understanding of Nature had to be statistical was unacceptable to 
a lot of people including Einstein, and there was substantial criticism of this aspect of 
quantum theory. Eventually, the ideas of von Neumann, Bell, Feynman, Mackey, Glea- 
son, and many others on the probabilistic aspects of quantum theory clarified the situation 
and answered the criticisms. These contributions have made the role of probability in the 


quantum world both far-reaching and profound. 


3.1. The statistical interpretation of quantum theory 

3.2. The uncertainty principle of Heisenberg 

3.3. Hidden variables 

3.4. EPR 

3.5. Transition probabilities in the atom 

3.6. Feynman path integrals and the Feynman—Kac formula 


3.1 The statistical interpretation of quantum theory 


The most controversial part of quantum mechanics, at the time of its creation, 
was its statistical aspect. People were unwilling to accept that Nature allows 
us only to calculate the probability distributions of physical quantities. Of 
course one can measure any quantity (in principle) with no dispersion, but the 
theory insisted that we pay for it by having to forego any knowledge of certain 
other observables. There was also a feeling that the new ideas demanded the 


* This essay and the previous one are based on lectures given at Howard University, 
Washington D.C., sponsored by my friend D. Sundararaman, in the 1990s. 
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abandoning of causality. The questions raised were deep and puzzling, and 
were not fully resolved for almost a decade or more. The roles of Einstein 
and Bohr in these discussions were decisive and are well documented. 

The fundamental statistical interpretation of quantum theory is due to Max 
Born. If y is the wave function of the electron, he postulated that 


i, lv(x, y, z)|°dxdydz 
E 


is the probability of finding the electron in the region E C R?. This is a spe- 
cial case of the following more general statement that was later obtained by 
von Neumann: if A;(1 <i < k) are commuting self adjoint operators repre- 
senting simultaneously measurable observables, and P is their joint spectral 
measure, and if the system is in the state [6] represented by the unit vector 6 
(we often say the state 6), the probability that a measurement on the A; will 
yield a value of (A), .... Ag) belonging to aregion E C R* is 


\|P26||°. 


To see how the Born prescription is a special case of this all we have to do is to 
take the Hilbert space to be L?(R*) and the operator A; to be the one that cor- 
responds to the i coordinate of the location of the electron, namely, A; is the 
multiplication by x;. As another example, if A is a self adjoint operator with 
simple discrete spectrum a, a2, ... and corresponding normalized eigenvec- 
tors Ww}, Wo,..., then, for a system in state [6] if a measurement is made to 
find the value of A, the probability that the value will be a, is |(w,.@)|* for 
(Ae large If H is the Hilbert space associated with a quantum system, 
and if y, 9 are two unit vectors, the quantity 


I(r, A)? 


is the probability that if the state of the system is [4] ([@] is the state defined 
by @) and an experiment is performed to verify that it is [y], then the system 
will be found in the state [y]. In particular, the expectation value jz9(A) and 
and the dispersion 07 (A) of A in the state [9] are 


te(A) = (A6,0), a 2(A) = ((A — re (A)1)’0, 8) 


In principle, ail statistical assertions of quantum theory are derivable from 
these. 
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This prescription furnishes the details of the fundamental departure of 
quantum theory from classical theory, namely, the fact that in quantum theory 
the state determines only the probability distributions of all physical quan- 
tities. There is no abandonment of causality: the state itself moves causally 
as determined by the Schrodinger equation, but the theory denies the possi- 
bility of determining the exact values of all the observables in any state. It is 
very easy to give examples of states, especially in the finite models, in which 
certain observables have sharp values while others do not have this property. 
This was the source of a profound controversy about the consistency and com- 
pleteness of the quantum mechanical view of nature which we shall analyze 
in more detail in a little while. For the moment we shall concentrate on the 
mathematical source of the statistical interpretation. 

One way to understand this picture is through von Neumann’s proposi- 
tional logic of quantum mechanics, namely the logic L of closed subspaces 
of the Hilbert space H.. Here we think of £ as a set with a partial order (impli- 
cation) C, and an orthocomplementation (negation) S +> S+. For example, 
suppose we are looking at one particle moving in R* and the experiment 
is measuring its position at a certain time f, then the quantum proposition 
would be that the particle was found in a region E C R®. The Hilbert space 
is L?(R°*), and the proposition is represented by the subspace of vectors van- 
ishing outside E. If instead we speak of locating the momentum vector of the 
particle in a region F C R’, the corresponding subspace is the subspace of 
states whose Fourier transforms vanish outside F (Here Fourier transforms 
are in the L?-sense). 

In full generality, if S is a closed subspace of representing an experi- 
mental proposition to be verified, then in the state y the probability that the 
experimental proposition is true is 


IPswil? 


where Ps is the orthogonal projection on S. These probabilities completely 
determine the distribution of any observable in the state y; indeed, if A is 
any observable and P% is its spectral resolution, then, in the state y the 
distribution of A is the probability measure 


E+ ||Pewll. 


This shows that the assignment 
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py: St |IPs¥ll 


on the logic £ of the system, the logic of all closed subspaces of H, completely 
determines all the statistical properties of observables in the state y. The py 
are often called vector states. It is easily verified that py has the following 


properties: 


(a)O0 < py(S) <1 forall Sel 

(b) py (0) = 1, py (H) = 1 

(c) If S; are mutually orthogonal closed subspaces of and S is their 
orthogonal direct sum, then 


py(S) = > py (Si) 


These properties generalize those of a probability measure on a Boolean o- 
algebra and so one can think of py as a probability measure on the logic L. 
In fact, we shall call any function p(a@ — p(a)) on an arbitrary logic with 
these properties a probability measure on the logic. If p 1s any probability 
measure on £, then for any observable A with spectral measure P” we may 
define the probability distribution of A with respect to p as the probability 
measure 
E +— p(Se), S; = thewange or ee 


So one can define a more general notion of a state of the system as a probability 
measure on £; the earlier notion is obtained if this measure is of the form py. 
Note that the correspondence 


[wv] =, Pw 


is bijective. Indeed, py depends only on [y], and if py = po then 
0= py(h~) = pow") => 0 € (WW)? => (6) = 1] 


Now the set of probability measures is a convex set closed under infinite 
convex combinations. These represent mixed states inthe conventional theory 
and are appropriate for quantum statistical mechanics. If one considers only 
the convex combinations of the vector states, py, it is an easy Consequence 
that we obtain the measures of the form 


IDR St Trey Ps) = Tr (PE Pep ey, 
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where 7 is a self adjoint nonnegative operator of trace class and trace 1. 
In conventional quantum theory T is called a density matrix or the density 
operator, the measures pr were first discovered by von Neumann, and (in- 
dependently) by Landau. Such 7 have a discrete spectrum and if its nonzero 
eigenvalues are ¢; > 0 with corresponding eigenvectors w; (orthonormal), 


then 
Pr= > nowy, (Cc; seni), > a a 1) 


t 


showing that if the system is in the state 7, it is in one of the states [w;| 
with probability ¢,, so that py is a classical superposition of the states [w;]. 
If the eigenvalues are simple, this decomposition is unique, but when there 
are multiplicities, there 1s nonuniqueness since for a multiple eigenvalue the 
choice of the orthonormal basis in the corresponding eigenspace is not unique. 
However, T is uniquely determined by pr. 

Now von Neumann had to assume much more about states to be able to say 
that they must be of the form p;. The question whether any measure on L is 
of the form p7 was raised by Mackey and was solved by Gleason in a famous 
paper [1], 2] where he showed that if dim(H) # 2, then every probability 
measure on L(H) is of the form py for a unique density operator 7. The 
restriction on the dimension is essential. It can then be shown that the vector 
states p, are the extreme points of the convex set of all states. Traditionally 
they are known as pure states; in the conventional model they are simply 
called the states, and represent the states with maximal information. 

In dimension 2 there are bizarre states not coming from density operators 
T; however there is no point in considering them because as soon as one 
forms the composite of a two-dimensional system with another one, we are 
in the case allowed by Gleason’s theorem, and so we are in the framework 
of states defined by density operators. Thus in all cases it is natural to work 
with the states pr. 

It is only after the Mackey—Gleason analysis that the conventional picture 
is complete, with the points of the projective space P(H) of H representing 
the (pure) states of the system. The function 


x :[w]. {0} > Iv, a)? 


is well defined on P(H) x P(H) and represents the probability structure 
on the set of states. A symmetry of the system is then a bijection of P(H) 
that preserves 7. The fundamental theorem of Wigner is that symmetries are 
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induced by unitary or antiunitary operators of which moreover are unique 
up to a multiplicative phase factor (complex constant of absolute value 1). 
If the symmetry group is connected, the symmetries are unitary. For many 
connected groups the ambiguities involved in the choice of the phase factors 
in the unitaries implementing the symmetries can be resolved, so that we can 
say that as a rule, symmetry is governed by unitary representations of the 
symmetry group or some central extension of it by a torus. Thus taking the 
group to be the group of time translations, time evolution is governed by a 
unitary representation of R, written 


tre U, = cant 
where H, aself adjoint operator whose existence and uniqueness is guaranteed 
by Stone’s famous theorem on unitary one-parameter groups, is identified 
with the energy or the Hamiltonian of the system. How do the states evolve 
in time? Under time evolution given by the unitary group above the pure 
states move by . 
th> y= ey, 


where wo 1s the state at time t = 0; infinitesimally, differentiating with respect 
to t, this becomes 


The conventional physical interpretation of the statistics of the observables 
under time evolution is now immediate. One remark: the dynamical groups 
defined by H and H + a where a is a real constant, induce the same actions 
on the projective space of #. Thus the Hamiltonian is ambiguous up to an 
additive constant. 


3.2 The uncertainty principle of Heisenberg 


From the very beginning quantum mechanics faced serious problems of phys- 
ical interpretation. The transition from phase space dynamics to the dynamics 
of non commutative algebras of observables and their probability distributions 
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raised difficult questions about physical interpretation. It was Heisenberg who 
took the crucial step of exploring the physical consequences of the commuta- 
tion rules that dominated the early formulations of the theory. By a brilliant 
analysis based on thought experiments he was led to his famous uncertainty 
principle which has become the corner stone of all physical interpretations of 
quantum theory. 

The uncertainty principle of Heisenberg illustrates the basic point that in 
quantum mechanics it is impossible that all observables have sharply defined 
values in any given state. For any observable A let jz, and o; denote its mean 
and dispersion in the state p,,. If oy is small, then most of the probability 
distribution is concentrated in a small neighborhood of jz; in such a case 
we shall say that the observable is highly localized in the state yw. Let us 
now consider the quantum system of a particle moving in R whose position 
and momentum operators are defined as multiplication by x and —ihd/dx 
on H = L?(R). If w is a state to which these operators can be applied in 
succession (for instance if y is a Schwartz function), then, writing A? and 
we for the respective dispersions of position and momentum, one can show 
that 

h 
A, A, = 3 
which shows that any attempt to define the position sharply implies a very high 
dispersion in the value of the momentum and vice versa. This is the famous 
uncertainty principle of Heisenberg. This of course is mathematically easy 
to prove, and one can even give a complete description of the states where 
the product of dispersions reaches its minimum value, the so-called coherent 
states. However Heisenberg’s real accomplishment was not the mathematical 
part but the physical part, wherein, using remarkable thought experiments 
that involve trying to locate an electron using a gamma ray microscope, he 
showed (see [3, 4]) that the laws of classical electrodynamics and the wave— 
particle duality relations already place such severe restrictions on the errors 
of observations on the position and momentum that one must have, up to 
second order inh, 
ges. colts 


The wave-particle duality of light which figures prominently in Heisen- 
berg’s discussion was emphasized by Bohr as a corner stone of quantum 
theory. Bohr would eventually call it the principle of complementarity of 
quantum systems. Its basis is the fact, established by many experiments, 
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that all particles have wave properties. The photoelectric effect reveals really 
particle properties of light, while the phenomenon of electron diffraction can 
be explained only under the assumption that electrons have wave properties. 
The particle and wave aspects constitute the so-called wave—particle duality 
of all matter, a basic fact in atomic physics, that was first put forward by de 
Broglie. However the wave and particle aspects of matter are complemen- 
tary; no single experiment can reveal both properties. This was how Bohr 
formulated his principle of complementarity. From our mathematical point of 
view, the non commutative nature of the logic reflects this complementarity. 

Let me briefly discuss the mathematical part of the Heisenberg uncertainty 
principle. We have two self adjoint operators A, B satisfying the commutation 
rule 

(Ay si 7a! (ae R,a #0) 


We wish to show that if y is in the domains of both AB and BA, 


Oy(A)oy(B) = a 


We replace A, B respectively by 
A’=A-—py(A)l. Be B 1 0BN, 
a change that does not affect the commutation rule. Then, from 


(A'B iy — B'AW, wv) = ia 


(A'y, By) — (By, Ap) = 23(A'Y, B'y) = —ia 
Then, by Schwartz’s inequality, writing |a| = ea, 
(*) 


la| x , , ! , , 
a = TENA. BY) < (AW. BW) <A WIINIB WI] = oy (A)oy(B) 


The same argument proves a more general result due to H. P. Robertson: if 
Ac Bl =16 


then 
I 
Oy (A)oy(B) => 5l(Cy, w)I 
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As an illustration of this more general inequality Robertson considers the 
angular momenta 


M,, My, M, 


around the three axes which satisfy the commutation rules 
[M,, My] =iAM,, etc 


showing that the product of the standard deviations of two components of the 
angular momentum vector in any state is always greater than or equal to h /2 
times the absolute value of the mean of the third component in that state. 

In the finite models of Weyl and Schwinger which we have talked about 
earlier, there are complementary observables x, y with the following property: 
if one of them has a sharp value, all the values of the other are equally probable. 


3.3 Hidden variables 


Classical statistical mechanics is also a statistical theory in which the states 
are probability distributions on phase space which evolve causally according 
to the equations of motion. From the beginning a major controversy had 
raged about the true nature of the statistical aspects of quantum theory. Many 
people felt that the reason why quantum theory has to be statistical is that 
one cannot know the exact state of the atom, so that the situation is very 
similar to statistical mechanics where averagings over the phase space are 
used because we cannot know the precise mechanical state of a system whose 
number of degrees of freedom is really huge (~ 107%). The argument was 
then made that the situation is analogous in quantum mechanics and that the 
statistical properties of quantum systems are the result of averaging over some 
unknown or hidden variables, as they came to be called. This argument, thus 
pursued, then led to the conclusion that quantum mechanics was an incomplete 
description of physical reality. Whether this is actually true or not is the so- 
called hidden variables question. 

The first to deal with this question in depth was von Neumann. He reached 
the remarkable conclusion that the results of quantum theory cannot be dupli- 
cated by any hidden variables mechanism. Because of the fundamental nature 
of the controversy and its importance, von Neumann gave both a mathemat- 
ical and physical argument; they were not complementary but rather they 
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reinforced each other. For additional details see [2, 6] and my survey [5c] of 
Mackey’s work. 

The physical side of von Neumann’s argument is as follows. If a hidden 
variables explanation is to be allowed, then we should be able to resolve any 
ensemble into finer and finer ones where more and more observables have 
sharply defined values. But this method of successive resolution is doomed 
from the start because of the fact that in atomic systems, any measurement 
changes the system, and hence changes the values of quantities which have 
been previously measured. Thus, in the resolution of atomic ensembles we do 
not get ahead by such a procedure and there seems no way of reaching those 
ensembles where every observable has a sharply defined value, ensembles 
that are called dispersion free. The analysis of Stern—Gerlach experiments 1s 
a vivid illustration of this argument. 

Nevertheless, one may still cling to the hope that there may be other ways of 
reaching the dispersion-free ensembles. To examine this further it becomes 
necessary to enquire more into the structure of the dispersion-free ensem- 
bles and see whether their existence is in conflict with quantum mechanical 
experience. This leads us to the mathematical side of the argument. 

The mathematical argument that I shall present now, which is based on the 
Mackey-Gleason analysis and is a significant sharpening of von Neumann’s, 
is that in the conventional model there are no states that are dispersion free 
except when we are in the trivial case where dim(H) = |. For obtaining 
this result in as general a context as possible von Neumann replaced the 
ensembles by the functionals characteristic of the ensemble, which associate 
to each bounded observable its expectation value in the state defined by the 
ensemble. So von Neumann viewed an ensemble as the functional 


E:By —R 


where Bp is the space of all bounded self adjoint linear operators on the 
Hilbert space H, such that, for any A € Bo, €(A) is the expectation value of 
the observable represented by A in the state (=ensemble) identified with &. 
It is natural to assume that this functional had the following properties: 


(a)E(1) =1 

(b) E(A7) > 0 forall A € Bo 

(c)E(aA+bB) = a€(A)+ bE(B) if A. B are commuting elements of Bo 
and ape R. 
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But von Neumann assumed much more; for instance (c) was required even 
when A and B do not commute, a much more tenuous hypothesis than (c) 
obviously. He also assumed, for technical reasons, continuity properties for 
€. We shall not assume these extra properties. We shall call as stares the 
functionals satisfying only the conditions (a)-(c). This definition of a state is 
the widest possible compatible with von Neumann’s idea of identifying the 
ensemble with the state. 

If € is a state and A is an observable, then the dispersion of A in that state 
is defined as the number 

E(A’) — (E(A)Y? 


which is always > 0. The state is called dispersion-free if this is 0 forall A. 
In such a state, any observable A has a sharply defined value, namely €(A). 
Let £ be the logic of a physical system. For any a € CL let Q, be the 
observable which is | if the experimental proposition a is verified to be true, 
and 0 if it is verified to be not true. The Q, are the so-called questions of 
Mackey [5]. Then in any state there is a probability p(a) that a is true, 


Pr (Q.) = p(a). 


Clearly 
0=pla@j=!1, pO)=90, pi)=! 


and since Q,, takes only the values 0 and 1 we have 
Expo ,) — pita), Dispersion (Q,,) = p(a)(1 — p(a)) 
So a state determines a function 
p:£— (0, 1], at p(a) 


This is just the state functional but it is now restricted only to the questions. 
Ifa; € L(i = 1,2) and a, < ay then one can form the sum a = a, + a) in 
£, the two propositions are simultaneously verifiable, i.e., define commuting 
projections, and one should require that 


pia — pai) + pa). 


In other words, the function p is a finitely additive probability measure on L 
and every expectation functional gives rise to a finitely additive probability 
measure p on L£. If the state is dispersion-free, then p(a)(1 — p(a)) = 0 
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so that p(a) = 0 or | for any a. Let us call a finitely additive probability 
measure on £ which takes only the values 0 and 1 a two-valued measure. To 
prove that there are no dispersion-free states it is thus enough to prove that 
there are no two-valued measures on L. 


Theorem. (a) /f L is a Boolean algebra, there are two-valued measures on 
Land every finitely additive probability measure on L is a natural mixture 
of these. (b) If L admits no two-valued measures, then L has no imbedding 
into a Boolean algebra. (c) If £L contains a sublogic isomorphic to L(D*) 
where D = R,C, then L admits no two-valued measures. In particular 
L(H) has no two-valued measures and so has no dispersion-free states if 
3 —dim (1), = eo; 


Proof. (a) is just the classical theorem of Stone. In fact, £ (is isomorphic 
to) is a Boolean algebra of subsets of some space X; if x € X, the Dirac 
measure 6,, which is one if A € £ contains x and 0 otherwise, 1s a two- 
valued measure. Clearly every measure is a mixture of these. (This argument 
reverses the historical development. Indeed, Stone proved the isomorphism 
of an abstract Boolean algebra with a Boolean algebra of sets in some space 
by first proving that there are two-valued measures, and identifying these as 
the points of his space!) 

(b) follows trivially from (a). 

To prove (c) it is enough to prove that there are no two-valued measures on 
L(D"). But if p is atwo-valued measure, it is now countably additive and so 
Gleason’s theorem applies and shows that p comes from a density operator 
T. If w is a unit vector, then (Tw, yw) = p(D-w) is either 0 or 1, and so by 
continuity, is = 0 or = 1, showing that T = 0 or T = 1. This is impossible. 
If dim(H) > 3, then we can find a subspace S of dimension 3, and the map 


SS SOs SCS) 
is an imbedding of £(S) as a sublogic of L(H). This finishes the proof. a 


The restriction on dimension is essential. It is possible to show that for 
the conventional model in dimension 2 the statistics can be obtained through 
the hidden variable mechanism (see [6]). This is somewhat remarkable and 
paradoxical because it was the Stern—Gerlach experiments involving spin that 


played a prominent role in understanding and motivating the conventional 
model! 
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The structures of measures on a Boolean algebra and a non commutative 
logic are thus essentially different. For instance, in the conventional model 
for quantum mechanics, not only are there no two-valued measures but every 
probability measure on the logic takes all the values between 0 and 1. The 
complementarity structure is too rich to allow gaps in the values. Moreover, 
in almost all models, the logic is a lattice, and for any measure p one has 


p(a\) = 1, play) = 1 => p(a,Na) = 1 


which is very surprising since the assumptions on p involve only commuting 
projections. The point is that the very fact that the measure be defined for all 
elements of the logic forces the coherence properties such as the ones above. 

There are variations of the above argument where the emphasis is on for- 
mulating the hidden variables question as that of imbedding the quantum 
propositional logic inside a Boolean algebra. See 


¢ N. Zierler, and M. Schlessinger, Boolean imbeddings of orthomodular sets 
and quantum logic, Duke Math. Jour., 32(1965), 251-262. 

¢ S.Kochen, and E. P. Specker, The problem of hidden variables in quantum 
mechanics, J. Math. Mech., 17(1967), 331-348. 


Imbeddings of the logic inside a Boolean algebra will give rise to two- 
valued measures on the logic which can be forbidden under suitable condi- 
tions. Thus, ultimately, it is the absence of two-valued measures on logics 
that is the reason for the impossibility of a hidden variables explanation of 
quantum mechanics. 

This brief discussion does not do full justice to the remarkable contributions 
of many people to this circle of ideas, to the structure of quantum logics, 
to probability on logics, and the philosophical implications of these ideas. 
Some of these can be found in the references below. See also the articles in 
the Wheeler—Zurek [7b] and Lahti—Mittelstaedt [10c] volumes. 


© J.M. Jauch, Foundations of Quantum Mechanics, Addison-Wesley, 1968. 
© C. Piron, Foundations of Quantum Physics, Benjamin, 1976. 

° M. Jammer, The Philosophy of Quantum Mechanics, Wiley, 1974. 

C. A. Hooker (Ed.), The Logico-Algebraic Approach to Quantum Mechan- 
ics, Vol. 1., Reidele 1975. 

E. G. Beltrametti, and B. C. van Frassen, Current issues in Quantiun Logic, 
Plenum, 1981. 
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Bell’s inequalities 


In the 1960’s J. S. Bell, dissatisfied with the von Neumann argumentation, 
challenged the assumptions underlying it [7a]. It is difficult to accept Bell’s 
objections in the general framework treated by von Neumann, Mackey and 
Gleason that we are considering. Nevertheless Bell’s work was important 
because, by considering certain restricted situations, he was able to bring the 
hidden variables question into the domain of experimental testing. 

Bell considers a system that consists of a pair of spin 1 /2 observables which 
have interacted in the past and are now sufficiently apart to be distinguished 
as particles 1 and 2. Such systems had been considered earlier in connection 
with the so-called EPR paradox (see below). The spin orientations of the two 
particles can then be measured by Stern—Gerlach magnetic filters and their 
correlations computed and verified. One can also work with photons and 
their linear polarization states and measure these by polarization analyzers— 
the mathematics of computing the correlations between the two particles is 
essentially the same in both cases. Bell now wanted to see if these quantum 
correlations could be explained on the basis of a hidden variables model 
which was local in the sense that measurements can be made on each particle 
without apparent disturbance of the other. He was then led to what are now 
known as Bell’s inequalities which had to be satisfied by the correlations if 
the statistics of the experiment that Bell proposed were to be explained by 
a local hidden variable mechanism. This discussion allowed him to propose 
experimental tests whether his inequalities were indeed violated in practice. 
Many experiments have since been conducted and they have revealed that 
Bell’s inequalities are indeed violated, and that the observed correlations are 
exactly as predicted by quantum mechanics [8, 9]. I shall now discuss briefly 
Bell’s ideas. 

We begin with the two-dimensional Hilbert space K = C? and the usual 
model for quantum mechanics associated with it; the calculations below are 
standard. Let © be the set of obsevables which only take the values +1; these 
are parametrized by hermitian matrices X (a) corresponding to unit vectors a 
given by 


ay ag + ia, 
az-1ia3° —a, 


x(a) = ( ) a = (a), a2, a3), ||al| = 1. 


We choose normalized eigenvectors s,(a) of X (a) for the eigenvalues +1. 
If Pi(a) are the orthogonal projections on the one-dimensional spans of 
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s+(a), then 


X (a) = P,(a) — P_(a), I = P,(a) + P_(a) 
from which we get 
I a ae, G 
Fea) = eee) 


We write u” for the vector whose components are complex conjugates of 
those of the vector u € K. If u,v are two unit vectors and P,. P, are the 
orthogonal projections on the one-dimensional spans of u, v, then 


in@eas |, 9)|?. 


Hence, as Tr(X (a) X (b)) = 2 a-b, we get, 


> 1 . . 1 
I(sz(a). s2(b))|> = sn(u a2 X(a)) (7 a x(b)) = (1 + (4)(t)a8)). 
We shall now consider the state of the form 


1 
T= —(tt ©t.+t* @t_) 
2 a 


where {t,.t_} is an ON basis of K. It is a simple calculation to verify the 
following crucial property of T: 


t is independent of the choice of the ON basis {t,, t_}. 


This means that for any ON basis {u, v} for K we can write 
: ( *@ut+v ® v) 
T= —[u ®u 
2 


Write 
X\(a) = X(a) @ 7, KGa) = 1S) X (a): 


Then X,(a) and X2(b) commute for all a, b and we can calculate their indi- 
vidual and joint distributions in the state t. We use the above expression for 
t when u = sz(a)*, Vv = sx(a)* and v = si(b), Vv = s~(b). First we have 


1 
P,({X)(a) = £1) = P,(X2(b) = £1) = 5. 


If we set 
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P..(a, b) = P.(X1(a) = £1, Xo(b) = £1) 


then 
P..(a, b) = |. sz) es.) 


Now, if we write 
a — (4),2>, a3) 


then 
(ae a) 


so that 
X(@ )\smba)” = sagaia)". 


We may thus replace s4(a)* by si(a™). Hence 


l aw 9 1 +a™~-b 
P,+(a, b) = 7 (Oe ), 54(b))|- = — 
I 7 1+ ab 
P__(a,b) = 5 I(s_(@"), s_(b))|* = —o 


l = . ]—a -b 
P,_(a,b) = Aen lesa De ag 


] o = 1—a™-b 
Poplar Dd ql(s-fa Dp s.(b))}- fa 


Let us write 
Com(a, b) := E, (xv@)x20)). 


We then find 
om (a, b) = a-b. 


At the same time we have the remarkable fact that the value of X,(a~) de- 
termines the value of X2(a) and vice versa. This is the essence of the EPR 
property of the state t which we shall discuss in the next paragraph. This 
property is simply the relation 


P, (xa) = Xo(a)) = 


which is immediate from the formulae for Ps. (we must remember that 
aa = 1), 

Bell now asks the question whether the correlations Pom can be obtained 
from a local hidden variables model. This means there should be a probability 
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space {2 with a probability measure m on it, and corresponding to each unit 
vector a € R* two random variables §(a), n(a) such that (&(a), 7(b)) has 
the same joint distribution as (X | (a), X>(b)). Since X,(a~) = X2(a) with 
probability 1, it follows that we may assume that 


n(a) = E(a’). 
If we define 
Py (a, b) = En(E(a)n(b)) = E,, (E(a™ )E (b~)) 


then Bell’s question is the following: is it possible to choose the probability 
measure m and the correspondence X <—> & such that 


Pom(a, b) = Pv (a, b) 


for all unit vectors a, b € R*? 

Bell answered this in the negative. He discovered that the correlations of 
any set of +1-valued random variables with mean 0 on a classical probability 
space necessarily satisfy some inequalities that are now called the Bell’s 
inequalities. Clearly |§&&| < 1 so that |E,,,(€), §2)| < 1, but much more is 
true. Indeed, if &(i = 1. 2,3) are any three such variables on (&2, m), then 


We eet sie3 4 cagr =. 0 on Q 


because at any point of Q two of the three variables must have the same sign. 
Replacing & by +& and taking expectation values and writing 


Vij = Vii = Vv (a; ), (a; )) 


we have, 
1 + e,eoyi2 + €2€3723 + €3€1¥31 = O 


where €; = +1. Choosing the ¢; appropriately we get 
(B3) Wie Vis) = 1 — ys 


To violate (B3) we take the a; in the plane a; = 0 (so that aj = a; ) with 
angles 7/3 between adjacent ones so that, writing X; = X (a;), 


Coo AX Sa) 2, (X3,X1) = —1/2 
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Then, writing 
Be ow 


we have 
[T12 —Tis| spl | = T23| = 2 


For a detailed and beautiful discussion of the experiments to test Bell’s 
inequalities see the account by Aspect and Grangier [8, 9]. For the original 
papers of Bell and others not only on this question but about the entire theory 
of measurement in quantum theory, see the monumental book of Wheeler- 
Zurek [7b]. 

In conclusion one can now say with a great amount of conviction that the 
hidden variables question has been decisively answered in the negative. There 
is no way, compatible with the current experimental and theoretical structure 
of quantum theory, to obtain the results of quantum theory by any kind of 
reasonable classical hidden variable scheme. 


3.4 EPR 


Can quantum mechanical description of physical reality be considered com- 
plete? This was the title of the famous paper [10a] of Einstein, Podolsky, 
and Rosen. In the initial years after the discovery of quantum mechanics 
and its probabilistic interpretation by Heisenberg, Dirac, Born, Jordan, and 
Schrédinger, Einstein was very much opposed to it and believed that it was 
not internally consistent. In the Solvay congress of 1930 he came up with a 
thought experiment in which in which a box containing radiation was weighed 
before and after the release of a single photon at a pre-assigned time. He then 
claimed that this would allow the determination of the energy of the photon at a 
precise time, thus violating the uncertainty relation between energy and time. 
Bohr refuted this by showing that according to Einstein’s general relativity, 
the gravitational field involved in the weighing of the box will change the 
clock mechanism (that was present to release the photon at a precise time) 
just enough to maintain the uncertainty relation between energy and time. 
After this Einstein accepted the consistency of quantum theory. But in 1935 
he published the paper referred above, in collaboration with Podolsky and 
Rosen, in which they purported to show the incompleteness of the quantum 
mechanical description of reality. 

In this paper they constructed an example on the basis of which they argued 
that the quantum mechanical description of physical reality was necessarily 
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incomplete. Their example was a quantum system consisting of two particles 
which had interacted in the past but are now spatially far apart, and the state 
of the composite system had the following remarkable property: if the value 
of the position or momentum of the first particle is known, the value of the 
position or momentum of the second particle can be predicted with certainty. 
In this manner they argued that without disturbing the second system one 
can measure both the position and momentum of the second particle, a cir- 
cumstance forbidden by quantum mechanics. From this they argued that that 
quantum theory was an incomplete description of what happens in the atomic 
world. 

The impact of this paper on Bohr was remarkable. Here is what Rosenfeld, 
who was collaborating with Bohr at that time had to say. ... The onslaught 
came down upon us like a bolt from the blue. Its effect on Bohr was re- 
markable. We were then in the midst of groping attempts at exploring the 
implications of the fluctuations of charge and current distributions, which 
presented us with riddles of a kind we had not met in electrodynamics. A 
new worry could not come ata less propitious time. Yet, as soon as Bohr had 
heard my report of Einstein's argument, everything else was abandoned: we 
had to clear up such a misunderstanding at once. We should reply by taking 
up the same example and showing the right way to speak about it... . {11a} 

Bohr’s reply was published in the same journal and had the same title 
as the EPR paper. By discussing explicit experimental arrangements Bohr 
argued that the phrase without disturbing the second system had an essential 
ambiguity because of the complementarity phenomena in quantum mechanics 
[1 fajeblb). 

The essence of the EPR example is the following property of the joint 
system: there is a special state of the composite system such that to each 
observable A in the second system one can associate an observable B in 
the first system with the property that if a measurement yields a sharp value 
of B, then one can predict the value of A with certainty. This of course 
is not possible in every state of the composite system. For position and 
momentum measurements of particles in one dimension the EPR state was 
the distribution state (of infinite norm necessarily since we are dealing with 
continuous spectra) 


o= |e i= 1) 
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The integrals have to be interpreted as tempered distributions. If X;, P; arethe 
position and momentum observables of the / in particle, then if a measurement 
of P; is made and yields the value p in the state o, the state collapses to the 


state with wave function 
e! (x1—42)p 


This is an eigenstate of —i0/0x> for the eigenvalue — p. Thus in the state o 


we have 
P, @l=p=18Ph,=-p 


On the other hand, a simple Fourier analysis shows that o has another repre- 
sentation: 


oA i 6b; ® 5,dx 
from which we also have, by a similar argument, 
Xoo = ee 


Thus the knowledge of either the position or momentum of the first particle 
allows us to predict with certainty the position and momentum values of the 
second particle. It is not without interest to explore the ideas of this argument 
more carefully. I do not wish to add to Bohr’s discussion and refer the reader 
to his article [10b, 1!b]. The point roughly is that even though we do not 
seem to disturb the second system, we do disturb the first and hence the 
composite system. The spatial apartness of the two systems is a misnomer; 
any measurement of the first system immediately determines the value of a 
corresponding quantity in the second. The experiments are decisive on this 
point. See [9]. 

It is however of some interest to examine the structure of states like the 
ones that Einstein et all discovered, to characterize them, and preferably, to 
construct them with finite norm. We shall begin by extracting from the above 
discussion a very general definition of states with the characteristic property 
discovered by Einstein et al, at least for states of finite norm. Let us consider 
two systems S; with Hilbert spaces H;(i = 1, 2) and the composite system 
S = S, x S, with Hilbert space H = H; @ H». Leto € H be a unit vector. 


Definition. The state [o ] is said to be an EPR state of S relative to a set LX» of 
bounded observables of Sz if to each A> € > there is a bounded observable 
A, of S; and a borel function g from R to R such that 


3.4 EPR ay 
P (1@ Ap = g(a) | A, @1 =a) =1 


for all a € R; A. is then said to be linked to A). If 2X2 Is the set of all bounded 
observables of S2, we shall say that o is EPR with respect to S>. 


In order to determine when such states exist and to determine all of them, 
it is Convenient to use a special description of H = H,; @ H». We use the 
well-known fact that the elements of H are in natural one-one correspondence 
with conjugate linear operators 


L:H.2— H, 


such that 
Treh i ee oo 


where L'(H; —> H2) is the adjoint of L. The condition on L is equivalent 
to requiring that 

DE ee, ||? = 20 
for some ON basis (e,,) of H2, and then 


TCL ee calls 
n 


for any ON basis (e,) of H2. The corresponding element of H is 
~ Le, © €; 
n 


and is independent of the choice of the (e,,). We thus identify the states of S 
with the operators L. 

The key fact is that if L is any state of S, itis EPR with respect to £2 if and 
only if £2; Cc B,, the commuting algebra of LL, so that B, is the largest set 
with respect to which L is EPR. If now L is to be EPR with respect to S3 it is 
necessary and sufficient that L” L is a scalar; this is the same as saying that 


dim(H>) < oo and L = dim(H2)~'U 


with 
U :H2 — H, an anti-unitary injection . 
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The set of EPR states for S) is then in canonical bijection with the set of 
anti-unitary injections of 1 inside H. 

It is now easy to see that if the EPR state is defined by the anti-unitary 
injection U(H2. <> H,), then, for any observable B Of So, HBU sete 
observable of 5, linked to B by the EPR property. Intuitively, the existence 
of such an imbedding of the observable algebra of 5S inside that of S; is 
reasonable because the propositions of 5: must be found within S|. By 
Wigner’s theorem such an imbedding should be implemented by a symmetry 
H2 <> H,; the technical points that go beyond this heuristic reasoning are that 
H» must be finite dimensional and the symmetry should be anti-unitary'*“, 

Let us now finally assume that both Hilbert spaces are of the same finite 
dimension d. Then we conclude that the states that are EPR for S» are also 
EPR for all observables of 5, also, and are precisely all states of the form 


a=d'P > Ue @e; 


i<j<d 


where U is an anti-unitary isomorphism of H2 with H,, and (e;) is any ON 
basis of H2. The map 


By <> B, = UB,U™' 


then has the property that the value of either of the B; determines the value of 
the other with certainty. The state t in the discussion of Bell’s inequalities is an 
example of such an EPR state where the anti-unitary isomorphism K —> K 
is just complex conjugation. Examples of states such as these were first 
constructed by Bohm [12b]. Thus finally we see that all EPR states have the 
same form as the states that Bohm worked with [12a]. 

We have mentioned already that one can construct the analogues of the 
original Einstein-Podolsky—Rosen states in certain finite quantum systems. 
We take H; = H» = L(G) where G is a finite abelian group of order |G]. 
The scalar product is given by 


] 


9 = TE 


Dd F@eay™” (fee L7@) 


xEG 


For simplicity we consider the antiunitary isomorphism U : f -> f°" of 
Hy» with H). If (e,) is any ON basis of H, we have the representation of the 
corresponding state oy as 
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ee 7=1/2 
oe! se Sie, 


Now we have two ON bases of H2, namely {VIG|5.}reg and {€},.¢ where 
5, 1s the delta function at x and G is the group of characters of G. So 


I 
(*) ay = JIG| ) 6, @ 5, = —= YE" 
U aloes QE 


xeEG 


Let X> be a “position” observable in S$; with distinct values a,(x € G) and 
corresponding eigenstates 5,. For Y; we take the “momentum” observable 
in S> with distinct values b; and eigenstates €. Let X; = UX,U7~', Ve ea 
UY,U~'. For the pair (H,. H2) the EPR relations are summarized by the 
symmetrical formulae 


POX, 1=a,.1@ X2 =4,) =1,, P(¥;@1=h,18% =) =1. 


If we now compare these formulae with those of Einstein et al we see that we 
have here the finite analogue of their state. The original EPR state can then 
be realized as certain renormalized limits of these finite model states, in their 
continuum limit [12]. 


3.5 Transition probabilities in the atom 


The quantum theory developed in the 1920s had been able to solve, at least in 
principle, all problems concerning atoms and molecules. However the rela- 
tionship of this theory to the origins of quantum mechanics was not explained 
at all. In particular, the Einstein—Bohr frequency relations and the probabil- 
ities of the atomic transitions, which were worked out by Einstein based on 
thermodynamic considerations before the development of quantum theory, 
were not part of the new theory. This was not surprising since these features 
flowed from the interaction of the atom with the electromagnetic field, and 
this had not been studied. 

Dirac was the first to treat the problem of deriving these aspects of quan- 
tum theory from the basic principles of quantum mechanics. In a famous and 
beautiful paper [13, 14] written in 1927 (see also the treatment of the Dirac 
theory by von Neumann [4] and Weyl [15]), Dirac treated this problem by 
considering the quantum system consisting of the atom and the quantized ra- 
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diation field, and wrote down a suitable Hamiltonian for the combined system 
on the basis of which he calculated the transition probabilities between the 
energy levels of the atom; the theory led him to the Eisnstein—Bohr frequency 
relations as well as to the Einstein formulae for the transition probabilities. 

Dirac’s theory was quite remarkable and introduced many features that 
were absolutely new. First of all he had to face the fact that the classical ra- 
diation field was a system with infinitely many degrees of freedom and so its 
quantization went well beyond the principles of quantization of systems with 
finite number of degrees of freedom that had been treated hitherto. Moreover 
the quantized radiation field had to be given a particle interpretation right 
from the outset because the Einstein—Bohr relations involved the emission 
and absorption of photons, which meant that the particle aspect of radiation 
had to be at the very heart of the treatment. In Dirac’s theory it came out 
automatically that the classical radiation field, which was a wave field, ac- 
quired particle properties upon quantization, and that the particles obeyed the 
Bose-Einstein statistics, as they should because they were photons. This was 
the wave-particle duality of light, finally emerging out of Dirac’s treatment. 
Dirac’s theory was the first quantum theoretic treatment of a field, although 
it was non relativistic. 

I shall not go into the details of the Dirac theory except to give the formulae 
for the transition probabilities in unit time. In these formulae /(v) is the 
intensity of the radiation field at the frequency v, and k. k’ respectively refer 
to the initial and final states of the atom and v,, v,: their respective frequencies . 
The factors |Q,, |? were originally conjectured by Heisenberg. For absorption 
we have 


4m? : 
Vir > Ve, V = Ver — Vy Py © — 3 |Oe lt (vy 
3h 
and for emission 
Ve > Ver, V = VE — Vy 
An? 9 hv 
Pe ® —>1Q 4 |°| 1) + == 
3h? m2¢3 


It is to be noted that the probability of emission is the sum of two terms one of 
which depends on the intensity of the radiation field exactly as in the case of 
the absorption case, while the other term is independent of the intensity of the 
radiation field. This phenomenon had already been discovered by Einstein 
who called it spontaneous emission. 
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Planck’s radiation formula 


Suppose now that there are many atoms in the radiation field and the entire 
system is in equilibrium. Then, on the average, the number of transitions 
Vg —> vy will be the same as the number of transitions yj —> v,. Let N,. be 
the number of atoms of energy E, = hvy,. Then [15] the equation of balance 
leads to 

Ne, , i) 

Ny T(v) 


where 
a 


= Ba 
On the other hand, by Boltzmann’s theory, 


SEK T —hye/KT 


N, = constant e = constant e 


from which we get 


A tn i(v 
e A(vg M/KT 1 


To) (y= pe vy, > 0) 


This leads to the formula 


hv? 


ON 
(v) agent = 1) 


which is Planck’s radiation law! 


3.6 Feynman path integrals and the Feynman-Kac formula 


In 1942, Richard Feynman, a graduate student in Princeton University, wrote 
a dissertation in physics that contained a novel way of formulating quantum 
mechanics. Because of the war the publication of the results of this thesis was 
delayed and it was not until 1948 that they first appeared in print [17, 14]. His 
method was to start with a classical system and associate to it some sort of an 
integral on the space of classical trajectories using which he gave prescriptions 
for calculating the probabilities of events in the associated quantum system. 
This integral was completely determined by the classical action functional. 
This method, nowadays called the method of path integrals, has since then 
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spread into all aspects of quantum physics, and for many physicists it is the 
preferred mode of thinking and calculation. I shall now give a brief discussion 
of this method, the difficulties in formulating it as a rigorous mathematical 
instrument, and some situations where this can be done. 

The heuristic thinking behind Feynman’s method is as follows. In classical 
mechanics, if we consider two points x, y in configuration space, then in gen- 
eral there is only one path the particle can follow in going from x to y in time 
t. This path is in fact the one that makes the action a minimum, or at least an 
extremum. But in quantum mechanics, because of Heisenberg’s uncertainty 
principle, the position and momentum cannot both be exactly determined at 
any instant of time, so that the particle is smeared out and one can no longer 
say what its path is with certainty; the double slit experiments demonstrate 
this brilliantly. Feynman’s idea was that one should consider all possible 
paths that go from x to y in time t and average over them to get the quantum 
probability of going from x to y in time t, but that this averaging should not be 
with respect to a conventional integral because of the wave—particle duality 
and the uncertainty principle. He had the remarkable insight that each path 
from x to y in time ¢ has an associated complex amplitude, and that it is the 
average of these amplitudes over the space of paths that gives the complex 
amplitude whose modulus squared represents the probability that we want. 
Inspired by a famous paper [16, 14] of Dirac on the Lagrangian in quantum 
mechanics, Feynman then made his final and most beautiful discovery that 
the amplitude associated to a path u is proportional to 


ei /A) Stu] 


where S is the classical action. Using this more or less as a hypothesis he 
showed how to get all the basic results of quantum mechanics. His method, 
based on the Lagrangian rather than the Hamiltonian, proved to be very supple 
and would eventually be carried over to the context of quantum fields. For 
Feynman’s beautiful discussions see [17, 18]. 

The amplitude for getting from x to y in time f is called the propagator 
and is denoted by K,(x, y). One way to think of this is to regard the unitary 
operator U, that takes one from time r = 0 to time ¢ as an integral operator 
with kernel K,(x, y). Feynman’s heuristic argument leads to the following 
formula, called the Feynman path integral formula, for the propagator: 
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where Du is some kind of a “Lebesgue” measure on the space of paths uw such 
that (0) =wG@y=y. 

Unfortunately it is impossible to find a candidate for the “Lebesgue mea- 
sure” Du that takes the place of summation in this formula. The attempts 
going back to Feynman to define the integral representation of the propaga- 
tor by a limiting process similar to the ones used in the theory of Riemann 
integration are not convincing from the mathematical point of view because 
one cannot define a theory of integration that way. The integration theory 
has to be defined first, and then one can use such approximation schemes to 
compute a well defined integral, but not the other way around. Nevertheless 
this method has proved very popular among physicists. From the mathe- 
matician’s point of view the method itself has to be thought of as a formal 
instrument with great suggestive power, which can often lead to predictions 
that can be verified by other methods. 

This brief discussion does not do justice to the beauty and novelty of 
Feynman’s ideas. In[{18] he gives a brilliant discussion of the familiar double- 
slit experiment with electrons passing through two holes and falling on a 
screen, and how one is led to the uncertainty principle of Heisenberg. One 
can perform the experiment with very weak streams of electrons and ask what 
are the probabilities P, P}, P: where P is the probability that the electron 
arrives at a position x on the screen when both holes are open, and P; 1s the 
probability when only hole i is open (i = 1, 2); these are functions of x. If 
these events have the meaning they have classically, we would have 


P=P\+P, 
but experimentally we find that 
Pox Pi --P5, 


The interpretation is that when both holes are open, we cannot say that the 
electron goes through one hole or the other in order to arrive at the screen! 
To avoid making this interpretation the answer is to make an experiment to 
make sure that the electron does indeed pass through one or the other hole, 
by inserting a light source at each hole which would scatter the electron when 
it passes through that hole. The results are unambiguous: every electron that 
arrives at the screen has passed through one or the other hole! In Feynman’s 
heuristic picture, the events have probability amplitudes (which are complex 
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vectors) &. @). $2 and one always has @ = @; + @2 but for the probabilities 
we may have P = |@|? # P; + P2 = |dil? + |@2|°. I give here an extended 
quotation from [18]: 

_.. We thus see that any physical agency designed to determine through which hole 


the electron passes, must produce, lest we have a paradox, enough disturbance to 
alter the distribution. . . 

It was first noticed by Heisenberg, and stated in his uncertainty principle, that 
the consistency of the then new mechanics required a limitation to the subtlety to 
which experiments could be performed. In our case it says that an attempt to design 
apparatus to determine through which hole the electron has passed and delicate 
enough so as not to deflect the electron sufficiently to destroy the interference 
pattern, must fail. It is clear that the consistency of quantum mechanics requires 
that it must be a general statement involving all the agencies of the physical world 
which might be used to determine through which hole the electron passes. The 
world cannot be half quantum mechanical and half classical. No exception to the 
uncertainty principle has been discovered... . 


... Tosummarize then: the probability of an event (in an ideal experiment where 
there are no uncertain external disturbances)is the absolute square of a complex 
quantity called the probability amplitude. When the event can occur in several 
alternative ways the probability amplitude is the sum of the probability amplitude 
for each alternative considered separately. 


If an experiment capable of determining which alternative is actually taken 
is performed the interference is lost and the probability becomes the sum of the 
probability for each alternative. 


The precise mathematical formulation of the Fenyman path integral, at 
least in the case of particles moving in euclidean space under some otential 
field (and later, for some quantum fields) did not come till the 1950s and was 
a consequence of some deep ideas of Mark Kac [22b]. 


Feynman-Kac formula 


In the 1950’s the mathematician Mark Kac noticed that if we change in the 
Schrodinger equation the time variable ¢ to it where i = (—1)!/? then it 
becomes a diffusion equation which governs the transition probabilities of a 
Markov process. To any stochastic process one can always associate a prob- 
ability measure on the space of paths of the process and so one has a genuine 
path integral for computing the propagator of this process. This path integral 
can then be analytically continued in the time variable and the Feynman prop- 
agator can be obtained by this analytic continuation. The formula that Kac 
obtained by pursuing this idea is known as the Fevnman—Kac formula and is 
one of the landmarks of rigorous path integral theory [19, 20]. I shall now 
discuss briefly this formula. In particular the derivation of this formula will 
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“explain” at least partially Feynman’s profound insight that the functional 
inside the integration for the propagator is proportional to the exponential of 
the action. 

To motivate the formula let us go to the finite model where a particle 
moves in a finite configuration space X, and the Hilbert space is H, the space 
of complex functions on X.' Let us assume that the Hamiltonian of the 
quantum system is of the form 


H=H-+V, Ho = —Ay 


where Hy = —Ay is the “free” part and V is a “potential”, V being a real 
function on X. The notation is clearly set up to highlight the analogy with 
the quantum mechanics of a particle in R‘ moving in the field of a potential 
function V; there the Hamiltonian is —A, + V where A, is the Laplacian in 
R‘. The dynamical group is U, = e''” and we wish to get a path integral 
representation for the kernel K,(x, y) = (U,6,,6,)(x, y € X). Because of 
the finite dimensionality of H the complex dynamical group U- = e*#(z € C) 
makes sense and so one can define 7, = e 7!” 
case this argument does not apply but if we assume that H > 0 in the spectral 
sense, then U- is defined for ‘i(z) < 0 without any problem. However, unlike 
the finite case 7; is defined only for t > 0; semigroups rather than groups in 


. In the infinite dimensional 


the infinite dimensional case. 
The basic assumption now is that the free part Ho has the following prop- 
erties: 
aoa |) 
2. If f is areal function on X and xo is a point at which f is minimum, then 
(Ax f) (xo) 2 0 


The second condition is known as the minimum value principle for A. This 
has a clear analogy with the Laplacian. Let 


at, y) a (Axdy, dx), A = (a(x, y))x.vex 


so that A is the matrix of Ay inthe basis (4, ). Itcan be shown that these condi- 
tions are equivalent to saying that the semigroup e~'/"" = e'®* is Markovian, 


i.e., the function 


p(x, y) = (e'**dy, 5x) (t > 0) 


! I owe this discussion to my friend Varadhan. 
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is the transition probability of a Markov particle being at y at time f if at time 
0 it was at x. This is the same as requiring that 


p(x,y)20 Yo ptr. y)=1 


The above conditions are also equivalent to the following. 


il, ae. y= 0 
2. (2) a(x, y) => O (x # y) (=> a(x, x) = 9) 


One can then build a family 
PAGER) 


of genuine probability measures on the space F of all step functions w on 
the positive time axis [0, 00) which are right continuous and take values in X 
such that the probability 


Pe(@| Oh) = 21, @() —29,..., 00) — 28) 
is equal to 


Pit, 41) Pen 1p 2)'- -~ Pig —ty_, COND 


We shall not go into the construction of these measures, although even in this 
simple situation this is somewhat nontrivial. 
Given the measures P, we can define the conditional probability measures 
Pi,x,y by 
Poyh)=-Pl@ek | og} 


We then have the Feynman—Kac formula in this situation: 
t a 
Kus y= fe BVO: dP, y - pyCx,y) 
F 


The original propagator itself is obtained by analytic continuation of the 
imaginary time propagator constructed above. 

To prove this we use the so-called Trotter product formula which is trivial in 
the finite dimensional case but requires more care in the infinite dimensional 
case: 


mie . - —+V N 
et (Ax V) = lim (4x @ acs) 
N—oo 
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A simple calculation shows that the matrix of 


(ei eae)" 


= Prin, 21) Dep (21, 22) .-- Pryw (Zn, ye NIV ED+ FV en) | 
where the sum is over all possible z,, 22. ..., zy_,;. Now the expression 
Pepn (X, 21) Ptyw (Z1, 22) -- - Prsw(ZN-1, ZN) 
can be written as 
Pee OO Zig O(N — 1)1/N) = zy) ~ p(x, y) 


and so the matrix of 
(ee 


becomes 


—(t/N) Dj <p<y Viwlrt/N 
e UN) Yizren VON GP. (Ww) + p(X, Y) 


The exponent is a Riemann sum which, when N — oo, goes to 


t 
/ V(w(s))ds 
0 
and so the whole expression has the limit 
[le Brett AP, se) a.) 
ua 


which is the formula of Feynman—Kac in this very special case. 
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Let us now go over from this “toy” situation to the continuous case when 


the configuration space is R. The Hamiltonian is 


Wee 


H = -~—~ 
2 dx? 


+ V(x) 


where we assume that the potential is > 0. It is known that if V is continuous 
this is an essentially self adjoint operator on the space of smooth functions 
with compact support and is > 0. Hence we can speak of the semigroup 
e 4 (¢ > 0). By analogy with what we did in the finite case we must asso- 
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ciate a probability measure with the semigroup e“/ 2)d*/dx* (¢ > Q). We can 
use Fourier transform theory to give sense to this operator. At the Fourier 
transform level d2/dx2 is multiplication by —E? and so e/2)” /4x* ig multipli- 
cation by e~/2"; in the original space it is then convolution by the inverse 
Fourier transform of e~/2® which is the gaussian density 


] 2 
e* /2t 
2m t 


So e(/2)4*/4* (¢ > 0) is an integral operator with the gaussian kernel 


at en 
e y)*/2t 


g(x,y) = J/ant 


This is the transition probability density of the well known Wiener process 
which models brownian motion. The corresponding probability measures 
W, defined on the space C[0, 00) of all continuous functions on [0, oo) and 
model brownian motion starting from x at time 0. It is Markovian and in fact 
has the property of having independent increments, 1.e., the increments 


X (ty) yw Kyps toe a) Son 


are independent if the intervals [s), t)],...[S,,t,] are disjoint. It is not sur- 
prising that the free part of the Hamiltonian corresponds to brownian motion 
because in the approximating finite case it is a random walk and typically the 
random walks have the browninan motion as their limits. The final formula 
of Feynman—Kac is 


t : 
(FK) Kit(x, y) = il @ 10 VG) We Ry) 
C{0,1] 

Here W,,,,, are the conditional brownian motion measures that start from x 
at time 0 and reach y at time tf. The meaning of the propagator is as follows: 
gl (t/2)d? /dx?—V(x)] 
is an integral operator on L*(R) with kernel K;,(x, y). Formally we get this 

by replacing 
ei InU/2Dx8) Hy 


by the (conditional) Wiener measures in Feynman’s formula. The real time 
propagator is obtained by analytic continuation [22c]. 
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In the F-K formula the potential is separated from the free part, and the 
integration is determined by a measure that is independent of the potential 
and involves only the free part. None of these is obvious from the Feynman 
heuristics and so the Feynman—Kac formula is indeed a great advance beyond 
the original formula of Feynman. 

How can this formula be used to get physically interesting results? One 
of the best situations where this formula can be applied is when we wish to 
obtain spectral information on the energy H. For this it makes no difference 
whether we consider e''” or e~'". For a thorough treatment of many such 
applications see [19, 20]. One of the striking results is that if an atom has a 
ground state it is unique. 

There are many generalizations of this formula. For systems with internal 
symmetry where the wave function has several components and the potentials 
are hermitian matrix-valued, see [21a, 21b]. There are however even more 
fundamental directions in which the theory can be extended, for instance, 
working in a Riemannian manifold where there is a natural Brownian motion 
corresponding to the Laplace-Beltrami operator. For homogeneous spaces 
such processes go back to the work of Yosida, but the probabilistic theory is 
governed by the great insights of Ito, and, after him, of Stroock and Varadhan. 
The theory has applications that reach deep into analysis and has allowed 
people (Bismut, Gestler, etc) to obtain proofs of the Atiyah-Singer index 
theorem. The literature is huge. It appears that the probabilistic method, 
by being able to follow what happens when one moves along a path of the 
Brownian motion, is able to really reach deep into the manifold. It is perhaps 
not an exaggeration to view it as a theory of the quantized manifold. 

Kac himself considers the FK-formula as one of his most striking achieve- 
ments. Here is his assessment taken from his autobiography [22a]. 


In its various guises the F-K formula is ubiquitous throughout much of quantum 
physics on the one hand and probability theory on the other. It is probably safe to 
say that I am better and more widely known for being the K in the F-K formula 
than for anything else I have done during my scientific career. 


But first a little history. At the end of the war Hans Bethe returned to Cornell 
from Los Alamos and brought with him a group of brilliant young experimental 
and theoretical physicists. Among them was Richard Feynman who, to no one’s 
surprise, went on to become a leading physicist of our day. It must have been in 
the spring of 1947 that Feynman gave a talk at the Cornell Physics Colloquium 
based on some material from his 1942 Ph. D dissertation, which had not yet been 


published. 
A fundamental concept of quantum mechanics is a quantity called the propaga- 


tor, and the standard way of finding it (in the nonrelativistic case) is by solving the 
Schrédinger equation. Feynman found another way based on what became known 
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as the Feynman path integral or “the sum over histories” which Dyson alludes to 
in his book Disturbing the universe. 

During his lecture Feynman sketched the derivation of his formula and I was 
struck by the similarity of his steps to those [ had encountered in my work, Ina 
few days | had my version of the formula, although it took some ume to complete 
a rigorous proof. My formula connected solutions of certain differential equations 


closely related to the Schrédinger equation with Wiener integrals. 


It is only fair to say that | had Wiener’s shoulders to stand on. Feynman, as in 
everything else he has done, stood on his own, a trick of intellectual contortion that 
he alone is capable of. 


I find Feynman’s formula to be very beautiful. It connects the quantum mechan- 
ical propagator, which is a twentieth century concept. with the classical mechanics 


of Newton and Lagrange, in a uniquely compelling way... . 
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Chapter 4 
Super geometry* 


Superalgebras serve as the basis for the construction of 
geometric objects, such as superprojective spaces and 
supermanifolds. The theory has applications to 
supergravitation in physics and it is studied by 
supermathematicians. 

Igor Shafarevich 


Supersymmetry was discovered by physicists around 1970. At first it was just a device for 
treating the bosonic and fermione aspects of quantum statistical mechanics and quantum 
field theory on an equal footing. Eventually, due to the remarkable ideas and contributions 
of many physicists and mathematicians, supersymmetry came to be understood as the 
symmetry of a new kind of geometrical object, namely, a super manifold. The super 
manifolds are objects in super geometry. which is a deep generalization of conventional 
differential and algebraic geometry. The symmetries of super manifolds form super Lie 


groups. 


4.1. The evolution of classical geometry 
4.2. Super geometry 

4.3. The theory of super manifolds 

4.4. Super Lie groups 


4.1 The evolution of classical geometry 


It is common knowledge that the idea of symmetry and the role of group the- 
ory in describing it go back to very ancient times. Indeed, the remote origins 
of the subject had been discussed long ago in Hermann Weyl’s beautiful book 


* This essay and the next are based on various articles and lectures on supersymmetry | 
have given on many an occasion, especially a series of lectures in the Courant Institute 
in New York, in 2002, and in Oporto, Portugal. in 2006. Iam extremely grateful to the 
large number of people with whom I have discussed the issues that form the core of these 
essays, and from whom I learnt most of the things I talk about. | would like to mention 
many notably Luigi Balduzzi, Claudio Carmeli, Gianni Cassinelli, Lauren Caston, Pierre 
Deligne, Sergio Ferrara, Rita Fioresi, Marian Lledo, Albert Schwarz. and Alessandro 


Toigo, to mention just a few. See also [1]. 
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Symmetry. However it was only in the twentieth century that it became possi- 
ble to erect a fully satisfactory mathematical theory of symmetry, namely the 
theory of Lie groups and their representations, that was adequate enough for 
all the applications. This theory, based on differential and algebraic geome- 
try, was one of the major achievements of mathematics in the last century . 
But, in the 1970’s, the physicists, driven by their desire to explain the main 
features of the physical world at the most fundamental level, began to enter- 
tain the idea that the singularities in their descriptions of nature arose at least 
partly from the inadequacy of the usual models of spacetime to furnish the 
background for events taking place in regions of ultra-small dimensions, not 
to mention gravity where spacetime itself is part of what is to be described. 
Proceeding in this direction they came up with new models for spacetime and 
its symmetries. One of these is based on the conception that the description 
of spacetime should be based not only on the usual coordinates but on ad- 
ditional Grassmann coordinates, reflecting the fermionic structure of matter. 
This idea became the genesis of a new geometry, namely, super geometry. 
One has to thank the physicists for their daring and imagination to have come 
up with such a generalization. The effort to build a proper mathematical 
foundation for the discoveries of the physicists led to the development of an 
entirely new part of mathematics, namely the theory of super manifolds and 
super Lie groups. This is a development of geometry beyond its conventional 
limits and it is not clear at this time in what directions it will continue in the 
future. Certainly the happenings in the world of elementary particles and 
their fields will have a lot to say about the future evolution of geometry itself. 
I feel therefore that it is natural to start this essay with a brief overview of the 
evolution of the idea of space and geometry, starting with ancient times and 
leading gradually to superspace and super geometry. 

From the historical point of view it is remarkable that already in 1854, 
Riemann, in his Gottingen inaugural address [2a, 2b, 2c], had speculated on 
the structure of space in its microscopic parts, and raised the possibility that 
the manifold structure may not be present in its infinitely small parts. He also 
advocated the point of view that we have to look for physics to tell us what 
this micro-structre should be. Here is what he wrote: 


Now it seems that the empirical notions on which the metric determinations of 
Space are based, the concept of a solid body and a light ray, lose their validity 


For a look at the development of Lie groups and their representations I refer the reader to 


a review I published in 2008 [2j]. It gives a bird’s eye view of the historical development 
of Lie theory. 
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in the infinitely small; it is therefore quite definitely conceivable that the metric 
relations of Space in the infinitely small do not conform to the hypotheses of 
geometry; and in fact, one ought to assume this as soon as it permits a simpler way 
of explaining phenomena. 


eee An answer to these questions can be found only by starting from that 
conception of phenomena which has hitherto been approved by experience, for 
which Newton laid the foundation, and gradually modifying it under the compul- 
sion of facts which cannot be explained by it. Investigations like the one just made, 
which begin from general concepts, can serve only to ensure that this work is not 
hindered by too restricted concepts, and that the progress in comprehending the 
connection of things is not obstructed by traditional prejudices. 


Riemann’s prophecy already started unfolding when Einstein discovered that 
spacetime, rather than space, is the invariant object attached to experience, 
and that its geometry is governed by a metric (in a pseudo Riemannian sense) 
of signature (+ — — —). If one thinks of all the various ideas about the 
micro-structure of spacetime that have come up in the meantime, one can 
only marvel at Riemann’s intuition and understanding. 

In order to put these developments in proper perspective we must begin by 
turning to the evolution of classical geometry which itself is tied intimately to 
the age-old problem of the structure of the space we live in. Indeed, for close 
to two thousand years, starting with Euclidean geometry, axiomatic geome- 
try was thought to describe the geometry of physical space. Consequently 
attempts to build alternate geometries were viewed as heretical, forcing even 
such a titanic figure as Gauss to keep his investigations in noneuclidean ge- 
ometry to himself, restricting himself to ironic comments in his letters to his 
scientific friends. However the discoveries of Bolyai and Lobachevsky, to- 
gether with the posthumous publication of the work of Gauss, clearly showed 
that the problem of what type of geometry governs physical space is quite 
different from the problem of what types of geometry exist under different 
systems of axioms. The usual parallel axiom leads to euclidean geometry, 
while a different type of parallelism leads to the noneuclidean geometry of 
Bolyai and Lobachevsky. Nevertheless the intuition coming from our experi- 
ence of the properties of the space we live in played a profound and mysterious 
role in determining alternative possibilities for what space can look like. 

The modern era of geometry may be said to have started with Riemann’s 
discovery of what we now call Riemannian geometry. One of Riemann’s 
greatest contributions was the realization that the geometry of a space must 
be built from its smallest parts. These small parts can be described by local 
coordinates X\, X2,...,Xn; and the infinitesimal distance, called the metric, 
between points with coordinates (x), ..., Xn) andyx) =P dx). ...2 t, + dx,) 
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is given by a Pythagorean expression 


ys Pi, 4. +, en aa ny 
‘j 


where Pythagorean means that the matrix (g;;) is positive definite everywhere. 
Later, with the advent of Einstein’s work and Minkowski’s famous expositions 
of it, it was recognized that one can also have pseudo Riemannian metrics 
which are defined exactly as above but with the demand that the matrix (g;;) 
have a fixed signature everywhere; if the signature is (+ — ... —) the 
geometry is called Minkowskian. It became clear from Riemann’s work that 
before one studies all the metrics that can be imposed on a space, one must 
first decide what sort of a space one has. In modern terms, one has to first set 
up the differential topology that the space ought to have, and then examine 
the possible metrics. For instance in studying compact Riemann surfaces, we 
classify the differential topology in terms of the genus, and then, for a given 
genus, study the complex structures that can be imposed on a surface of a 
given genus. 

Riemann’s lecture was published only after his death. It was a very difficult 
work to penetrate; for instance, it contained just a single formula. It intro- 
duced a cascade of revolutionary and profound ideas. It took almost a half 
century before Riemann’s ideas were unraveled, by the efforts of differential 
geometers such as Bianchi, Christoffel, Levi-Civita, Ricci, and many others. 
This pioneering effort to understand Riemann was followed by the discov- 
eries of Elie Cartan, and Hermann Weyl on global differential geometry and 
Lie theory. The view point of tensors dramatically changed the comprehen- 
sion of natural laws, leading eventually to Einstein’s great theory of universal 
gravitation as a manifestation of the curvature of a spaetime which is a global 
Minkowskian manifold of signature (+ — — —). 

Riemann’s method of building spaces began to be be understood better 
when Hermann Weyl published his ground-breaking 1913 classic [2d]. It 
was continued by Hassler Whitney in his great paper [2e] on general dif- 
ferential manifolds which started a revolution in differential topology. The 
idea was to obtain the global space by g/uing together small parts which can 
be completely described by local coordinates, together with a set of rules 
that tell us what coordinate transformations should connect these local co- 
ordinates on overlapping regions. This view emphasized the fact that many 
spaces would look the same locally, but would differ globally because of the 


4.1 The evolution of classical geometry 97 


different possibilities in the gluing process. This approach allowed one to 
treat smooth, real and complex analytic manifolds in the same framework. It 
was using such a framework that Chevalley was able to give the first global 
treatment of Lie groups in his epoch-making book’. It was Chevalley’s book 
that made the global view accessible to the general mathematical public [2f]. 

Then, in 1954, Serre [2g] showed how this method of constructing and 
studying manifolds by gluing local pieces could be extended to algebraic 
varieties defined over fields of any characteristic. This idea was generalized 
by Grothendieck and culminated in his theory of schemes, which apparently 
began as a series of remarks on Serre’s paper. Grothendieck’s ideas were 
developed in a monumental series of monographs published by the IHES, 
starting from [2h]. Serre’s framework was already sufficient for Borel to 
begin the theory of algebraic groups over fields of any characteristic and then 
for Chevalley to develop the theory of semisimple Lie groups over arbitrary 
fields, culminating in his epoch-making classification of semisimple groups 
and their representations over arbitrary fields [2i]. 

Let us now return to the description of manifolds. In the first place coordi- 
nates are the results of measurements which are necessarily only over small 
regions because any measuring apparatus has only a small finite extent. In 
this discussion we must remember that in physical applications words like 
local and global have meanings depending on the context and must not be 
thought of in everyday terms. For instance local or small may refer to dis- 
tances measured in magnitudes of the order of 10~'°cm in elementary particle 
experiments but the scale may change to millions of light years in describing 
the local structure of the universe. Moreover, although measuring originally 
referred to distances, eventually it came to be understood as a measurement 
of some aspects of the proximity properties of space that may be expressed 
numerically, or even measuring things like the pressure or temperature of 
some gas distributed throughout a star or a galaxy. Thus one replaces the 
idea of coordinates by the ring of all numerical quantities that are measured, 
the local ring. And space was to be thought of as a gluing together of these 
local rings. The mathematical tool to formulate this gluing is that of a sheaf 
of rings of numerical functions. 

But then Hilbert and Gel’fand realized that for the most important types 
of local rings of functions, the points at which the functions of these rings 


* Chevalley’s book appeared in the famous Princeton Series with a dedication to Elie 
Cartan and Hermann Weyl. I still remember the blurb on the cover jacket saying that “the 
reader need no longer be afraid of shrinking neighborhoods of the identity element! ! 
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are defined can be obtained from the rings themselves by looking at all the 
maps of these rings into the number system we are working with. This is 
usually described by saying that the domain of the elements of the ring is the 
spectrum of the ring. For instance the homomorphisms of the polynomial 
ring C[x1, ..., x] into C are precisely all the evaluation maps at points of ee 
(Hilbert), while the homomorphisms of the ring C[X ] of continuous functions 
on a compact Hausdorff space X are given by the evaluations at the points 
of X (Gel’fand). This made it clear that the function rings attached to the 
various regions of space already determine the local structure of space. The 
rings however have to be mutually compatible: when we take a ring associated 
to a region U and restrict the elements of it to an subregion V, we must get 
elements of the ring associated to V. This is expressed by saying that we 
have a sheaf of rings. The gluing must therefore match the sheaves on the 
individual parts that are being glued together. 

There is another remarkable aspect of this point of view: if we have ele- 
ments f},..., f, of the ring attached to the region U, we can perform opera- 
tions on these numerical functions and obtain new functions. Certainly, as the 
functions come from a ring, all algebraic operations are to be permitted. But 
in differential geometry where analysis is a great tool, we must allow arbitrary 
differentiable or analytic functions of fi, ..., f,. Thus by changing the scope 
of these operations we obtain the various kinds of geometry: algebraic, dif- 
ferential, analytic, holomorphic, and so on. By changing the number system 
we obtain real, complex, or p-adic manifolds. This is the modern perspective 
in geometry that has resulted in almost unlimited freedom in building models 
of space. 

This perspective reached its climax with Grothendieck’s theory of schemes; 
for a systematic exposition of the basics see [3a]. Proceeding by an intuition 
that was unlike that of anyone who preceded him, Grothendieck realized that 
a general theory that would be powerful enough to describe and solve all 
problems of algebraic geometry must be built on spaces where the local rings 
are completely arbitrary commutative rings. They would not be rings of nu- 
merical functions in general although they would still be formally attached to 
the regions of space and they would still be compatible; but now the restric- 
tion maps from a region to a subregion have to be part of the data defining 
the space. To recover some of the intuition lost in the process of making this 
generalization, the elements of the rings may be viewed as functions defined 
on the spectra of the rings. Thus the objects of study are topological spaces on 
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which we have a sheaf of commutative rings, the spaces themselves appearing 
as the spectra of these rings. 

However, for an arbitrary commutative ring, the knowledge coming from 
representing elements of the ring as functions on the spectrum of the ring, Is 
not complete because it loses information; there may be elements of the ring 
which give the value 0 at all points of the spectrum. For instance nilpotent 
elements have this property: indeed, if x” = 0 and / is a homomorphism of 
the ring into a field, we have h(x)" = 0 so that h(x) = 0. The presence of 
such elements in the local rings, as well as the desire to work with a// number 
systems, make the theory of schemes technically very difficult to develop, 
but very rewarding once this is done, because it leads to the deepest results. 
The scheme thus became the most general type of space that can be created 
with the property that it can be built from its smallest parts which are spectra 
of arbitrary commutative rings. In the evolution of the concept of space the 
scheme thus represented the ultimate stage. It was the great achievement of 
Grothendieck and his successors that they were able to formulate, and, more 
often than not, solve, the fundamental questions of algebraic geometry, in 
the framework of schemes. Moreover, the Janguage of schemes effected a 
surprising and profound unification of number theory with geometry. 

Unbelievably, Grothendieck took the notion of space even farther. He 
realized from the general theory of schemes he had built that the underlying 
structure of a space was completely determined by the category of sheaves 
defined on the scheme. This led to his insight that the notion of space itself as 
consisting of points and their proximity properties (topology) is dispensable; 
one can replace it with the category whose objects represent the open sets and 
whose maps are the possible fundamental maps we allow in this category. 
The sheaves, viewed as functors from this category, form the ropos. He then 
discovered the étale toposes of schemes that would enable him to connect the 
world of topology and differential geometry, the continuous world, with the 
world of number theory, the discrete world. I do not discuss this here since 
super toposes have not yet been used in super geometry. For a first glimpse 
of a topos see Allyn Jackson (p. 1052 of [3b]). 

One can say that as long as the quest was to describe a commutative world, 
the notion of schemes and even toposes may be regarded as the end of the 
evolution. Riemann’s prophetic warning against assuming that space is a 
manifold may thus be viewed as a cautionary reminder that even the schemes 
or toposes may be inadequate to describe the physical world as we acquire 
more powerful technical tools to probe into ever smaller regions of it. Already 
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quantum mechanics revealed that the algebras of physical quantities that come 
up in quantum theory are noncommutative. In supersymmetry we see the 
first indications that it may be worthwhile to relate these quantum algebras 
to spactime, and see if there are noncommutative aspects to spacetime itself. 


4.2 Super geometry 


To trace the beginnings of supersymmetry let us go to the one of the main 
features of quantum theory, namely, that elementary particles have an internal 
characteristic, their spin. The notion of spin, which is electromagnetic in its 
origin, allows one to make a crucial distinction between particles, namely, 
whether their spin is an integer or a half-integer. In the integer case they are 
called bosons while in the half-integer case they are called fermions. The 
distinction between bosons and fermions becomes crucial when one treats 
a system of identical particles. This may be called the spin-statistics con- 
nection. For an assembly of bosons there is no restriction as to whether 
they can all occupy the same state; indeed, the construction of lasers exploits 
this very possibility. Bosons are said to obey the Bose-Einstein Statistics. 
On the other hand, the fermions obey Pauli’s exclusion principle: no two 
fermions can occupy the same quantum state. As a result properties of matter 
differ dramatically in macroscopic phenomena depending on whether they 
are aggregates of bosons or fermions. The fact that fermions exclude each 
other has many consequences in phenomena of ultra low-temperature physics 
such as super fluidity and super conductivity. Fermions are said to obey the 
Fermi-Dirac statistics. 

The spin-statistics connection is a deep consequence of fundamental results 
on the structure of quantum field theories. If H is the Hilbert space of a single 
particle, the Hilbert space of a system of N such particles is S‘ (H), the space 
of symmetric tensors of rank N, if the particle is a boson; and A’ (H), the 
space of skew symmetric or alternating tensors of rank N, if the particle is a 
fermion. The equation 

ene =O, 


so characteristic of exterior or Grassmann algebra, is the codification of the 
statement that two fermions cannot both be in the state defined by the same 
unit vector e. However in quantum field theory, it is possible that pairs of 
fermions give rise to bosons and vice versa; for instance an electron and 
positron may annihilate each other and produce radiation, or a photon may 
decompose into an electron—positron pair. The Hilbert space of quantum field 
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theory thus admits states in which there are particles of either type. Thus the 
Hilbert space of one particle states is an orthogonal direct sum H = Ho® HH; 
here the vectors of Ho (resp. 7) define the states where the particle is a 
boson (resp. fermion). Supersymmetries then arise as transformations that 
exchange the bosonic and fermionic states. 

In mathematical terms one may say that the Hilbert spaces of quantum field 
theory are Zj-graded. The attempt to have a unified way of treating bosons 
and fermions on the same footing soon led to linear algebra over Zo-graded, 
or, super vector spaces. A super vector space is a vector space V with a 
distinguished grading V = Vo @ V, by the two element group {0, 1} = Z/22Z. 
V) and V, are called respectively the odd and even subspaces. They form a 
category admitting a tensor product, with specific rules, forming what is called 
a tensor category; | shall discuss this in a little more detail later. The result 
of this exercise is super linear algebra. 

However the essential point of departure came when physicists, notably 
Salam and Strathdee [4a, 4b], realized that one has to construct a graded 
generalization of classical geometry itself so that, after quantization, one 
obtains the graded Hilbert spaces so characteristic of quantum field theory 
in a natural manner. Now the manifolds in classical geometry are studied 
using local coordinates. The new ideas then led to the creation of a gen- 
eralization of classical geometry in which the usual local coordinates were 
supplemented by a set of Grassmann coordinates. The heuristic meaning 
of the Grassman coordinates was that they encoded the fermionic aspects of 
matter, in particular the Pauli exclusion principle that is at the basis of all 
properties of matter in the bulk. The automorphisms of such super manifolds 
were to be viewed as the supersymmetries, which form super Lie groups. 
The super manifolds, on quantization, led to the super Hilbert spaces and to 
the Fermi-Bose symmetries of quantum field theory. I must mention that the 
pioneering efforts of the (late) mathematician F. A. Berezin had an important 
role in the evolution of these ideas and in the creation of supersymmetry as a 
distinctive mathematical discipline. 

With these works the following became clear: the world itself is non- 
commutative, more precisely, supercommutative, and the quantum world of 
noncommutative algebras is a natural by-product of this super-commutative 
world. 

* The mathematician and physicist are going even further and looking at worlds which 


are even more noncommutative. I refer to the noncommutative geometry of people like 
Connes, Manin, Schwartz, Witten, Woronowicz, and others. I do not go into this terrain(!) 
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Because of the novelty and conceptual difficulty of this type of general- 
ization of classical geometry, the notion of supersymmetry was formulated 
at first only infinitesimally. The infinitesimal supersymmetries form a super 
Lie algebra, which is a generalization, in the super linear world, of a Lie 
algebra. Indeed, the concept of a super Lie algebra itself was born in the 
attempts by physicists to formulate Fermi-Bose symmetry. Since a super 
Lie algebra is a linear object it was not difficult to formulate its definition 
precisely and analyze its properties. Gradually, the concept of supersymme- 
try was invested with a more fundamental meaning, and the idea took hold 
that supersymmetry was present in the early universe, only to be broken later 
when the universe started cooling. As a part of this evolution the goals of 
the physical side of the theory also became clear: to work out electrody- 
namics, and more generally, Yang-Mills theories, and Einstein gravity, under 
the umbrella of supersymmetry. For instance, Wess and Zumino worked out 
the supersymmetric version of electrodynamics, while Ferrara and Zumino 
treated nonableian gauge theories in the presence of supersymmetry. Also one 
now has a minimal supersymmetric standard model, and its predictions on the 
masses of elementary particles are in the list of things to be tested when the 
new collider at CERN starts operating. Finally, supersymmetry was enlarged 
to cover Einstein gravity by the ideas of Freedman, van Nieuwenhuizen, and 
Ferrara, as well as Deser and Zumino [6c]. 

The structure of these supersymmetric theories is such that they offer many 
advantages over conventional theories: softer divergences, reduction of possi- 
ble Lagrangians, and so on. Although the global (as opposed to infinitesimal) 
formulation of supersymmetry was implicit in the treatments of the physicists, 
it was not fully revealed because of the difficulties in capturing the notion of 
super Lie groups (as opposed to the technically much simpler notion of su- 
per Lie algebras). In order to understand properly not only the infinitesimal 
but also the global geometric nature of super manifolds and supersymme- 
try at a fundamental level, it is therefore necessary to look more closely at 
the nature of super geometry as a generalization of classical differential and 
algebraic geometry. It turns out that what we need is a generalization of 
Grothendieck’s theory of schemes in which the local rings are supercommu- 
tative and have a special local structure, namely as a Grassmann algebra over 
commutative local rings. The presence of the Grassmann variables, which 
are nilpotent and invisible, has the consequence that the mathematics of the 
theory of super manitolds can be developed cleanly only by using the methods 
and techniques of the theory of schemes. This method, called the functorial 
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method by mathematicians, is in fact extremely transparent, and is the one 
that mirrors completely all the calculations that the physicists make, so much 
so that one can have the advantage of working in the informal way that is 
most characteristic of physics, and yet lose almost nothing in mathematical 
rigor. 


The Dirac mode 


I begin with a quotation from Ferrara, one of the pioneers of the subject, from 
his introduction to [6c}: 


Supersymmetry, i.e., Fermi-Bose symmetry, is one of the most peculiar discoveries 
in the history of physics. The peculiarity lies in the fact that although a tremendous 
theoretical effort has been made in the field. no experimental evidence of this 
symmetry has shown up. 


Part of the reason for the absence of experimental evidence for supersymmetry 
is that we may be very far from the energy levels needed for supersymmetry 
to be recreated. The collider being readied in CERN may be able to throw 
some light on this point. 

The question whether theory should guide experiment or vice versa, has 
been a matter of intense discussion among physicists always. The greatest 
exponents of creating the theory first are of course Einstein and Dirac. Here 
is what the great physicist Y. Nambu calls the Dirac mode of doing physics. 
In particular Nambu points out that supersmmetry is an example of physics in 
the Dirac mode. First, here is Dirac himself, in his introduction to his famous 
paper on magnetic monopoles [Sa]: 


The steady progress of physics requires for its theoretical formulation a mathemat- 
ics that gets continually more advanced. This is only natural and to be expected. . . 
Noneuclidean geometry and noncommutative algebra, which were at one time con- 
sidered to be purely fictions of the mind and pastimes for logical thinkers, have now 
been found to be very necessary for the description of general facts of the physical 
world. It seems likely that this process of increasing abstraction will continue in the 
future and that advance in physics is to be associated with a continual modification 
and generalisation of the axioms at the base of the mathematics rather than with a 
logical development of any one mathematical scheme on a fixed foundation. 


The theoretical worker in the future will therefore have to proceed in a more 
indirect way. The most powerful method of advance that can be suggested at 
present is to employ all the resources of pure mathematics in attempts to pertect 
and generalise the mathematical formalism that forms the existing basis of theo- 
retical physics, and after each success in this direction, to try to interpret the new 
mathematical features in terms of physical entities. . . 


On the other hand, let us hear from Y. Nambu [5b]: 
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The Dirac mode is to invent, so to speak, anew mathematical concept or framework 
first, and then try to find its relevance in the real world, with the expectation that 
(in a distorted paraphrasing of Dirac) a mathematically beautiful idea must have 
been adopted by God. Of course the question of what constitutes a beautiful and 
relevant idea is where physics begins to become an art. 


I think this second mode is unique to physics among the natural sciences, being 
most akin to the mode practiced by the mathematicians. Particle physics, in par- 
ticular, has thrived on the interplay of these two modes. Among examples of this 
second approach, one may cite such concepts as 


Magnetic monopole 
Nonabelian gauge theory 
Supersymmetry 


On rare occasions, these two modes can become one and the same, as in the 
cases of Einstein gravity and the Dirac equation. . . 


Some historical and scientific perspective 


In addition to Salam and Strathdee there is a whole galaxy of physicists 
like Ferrara, Wess, Zumino, and the members of the Russian school such as 
Gol’ fand, Likhtman, Volkov, Akulov, who contributed to the development of 
supersymmetry. On the mathematical side, first of all the late F. A. Berezin 
was one of the pioneers of super analysis and was very influential in the 
development of super geometry and super analysis as independent disciplines 
[6a, 6b]. The reader who wants to get a glimpse of the various historical and 
scientific strands should look at the monumental collection of articles (from 
the physicsts’ point of view) edited by Ferrara [6c]. In addition I wish to 
mention a set of thought-provoking essays by Fronsdal, Flato, Hirai [6d], and 
by Freed [6e]. In addition the introduction of Grassmann algebras as number 
systems to do physics of fermions goes back to Schwinger’s early papers [6f]. 
From the mathematicians’ point of view there are many, starting with Deligne- 
Morgan [7a], Kostant [7b], Leites [7c], Manin [7d]; and more recently, the 
exposition by Varadarajan [8]. For a systematic treatment of super geometry 
as a generalization of both classical differential and algebraic geometry see 
the monograph by Carmeli, Caston, and Fioresi [9a]. For expositions from a 
different point of view see [9d, 9e, 9f]. 
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4.3 The theory of super manifolds 


The precise mathematical formulation of the concept of a super manifold with 
local commutative and Grassmann coordinates can now be given by adapting 
the definition of a classical manifold to the new situation through the notion 
of a ringed space, using the ideas of Grothendieck, as explained above. 


The category of ringed spaces 


If X is aclassical smooth (C™ = infinitely differentiable) manifold, then for 
each open set U C X we have the R-algebra C™(U) of smooth real functions 
on U. The property of smoothness is /ocal and the mathematical concept that 
captures this is that of a sheaf, expressed by saying that the assignment 


Ur C*(U) 


is a sheaf of function rings. The rings C*(U) are often called the local rings. 
If X, Y are two smooth manifolds, a morphism, i.e., asmooth map, from X to 
Y is acontinuous map wy of X to Y such that for each open U C Y, the pull- 
back map y* 1s ahomomorphism of C*(U) into EG (yr-'(Y) ymin this way 
we have the category of smooth manifolds and smooth maps. But in order 
to define supermanifolds where there are Grassmann coordinates locally that 
are numerically unobservable, the above notion of a sheaf of function rings 
has to be generalized significantly, the generalization being achieved in two 
stages. 

The first step of such a generalization is already present in the notion, 
due to Grothendieck, of a ringed space. This is obtained by giving up the 
idea that the local rings consist of numerical functions defined on open sets 
U, but are rather abstract commutative rings, denoted usually by Ox (U). 
Unlike numerical functions the elements of these abstract local rings cannot 
be restricted to open subsets, and so we have to build the restriction maps 
as a part of the data defining a ringed space. Thus, for open sets U, V with 
V CU, we have restriction maps 


Oy(U) > Ox(V), fre fly 


with the property that if W C V Cc U are open, the restriction from U to W 
is the same as restricting first to V and then to W. The /ocal nature of these 
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“functions” is expressed by the requirement that the assignment 
Ut Ox(U) 


is a sheaf of rings. The sheaf axiom, which is the exact encoding of locality, 
is the following: if U is an open set and (U;) is an open covering of U, and 
if f; € Ox(U;) are given, then the following two statements are equivalent: 


(a) there is aunique element f € Ox (U) such that al = f; forall indices 
i 

(b) for any two indices ij, i2, fj, lui,0U = fi, 
Ox(U;, N Ui, )). 


(as elements of 


Ui, NW; 


A ringed space is thus a pair (X, Ox = O) where X is a topological space and 
O is asheaf of commutative rings on X. A morphism of aringed space X into 
a ringed space Y is acontinuous map y : X —> Y together with a pull-back 
map w* of Oy into Oy above y; this means that foreach open U C Y, w* is 
a homomorphism of Oy(U) into Ox(w-'(U)). Notice that unlike the case 
when the rings of the sheaves are rings of functions, the pull-backs w* for any 
open set U C Y have to be specified as part of the data defining morphisms, 
and that these pull-backs have to be compatible with restrictions. One thus 
obtains the category of ringed spaces and their morphisms. 

One may view the open sets in X as a category where the set of morphisms 
from U to V is empty unless U C V, in which case it consists of the nat- 
ural inclusion of U in V. Then a presheaf is a contravariant functor from 
the category of open sets into the category of rings, and the morphisms of 
ringed spaces are natural transformations between these functors above maps 
between the spaces themselves. 

In order to ascend to the concept of a super manifold, we have to take 
the second step of the generalization mentioned above which consists in 
replacing the commutative local rings by supercommutative local rings which 
have a local (not canonical) splitting as an algebra over commuting and anti- 
commuting local coordinates. Supercommutativity means that the local rings 
are Z)-graded, and commutative in the super sense, that is, instead of the 
relation ab = ba between arbitrary elements of the ring, we will have ab = 
tba where the sign is always plus unless both a and b are odd, in which case 
the minus sign ts taken. Thus a super ringed space is a pair (X. Ox) where 
Oy isasheaf of super commutative rings on the topological space X. A super 
manifold is then a super ringed space with the property that X is a classical 
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smooth manifold, and that locally the sheaf looks like the sheaf of rings on 
R’” of the form 


OMG (0) Fig eye) — Co (U):@ Aléy,.5. 8] 


where the &; are Grassmann variables satisfying 


=O) Ss, +5 —0 Gj =1,2....,4) 


and A[é,..., &,] is the exterior algebra over R in g indeterminates. The 
reader who is familiar with quantum field theory will recognize in the re- 
quirement of super commutativity for the local rings the idea that to make 
the transition from a description of classical fields to fermi fields one should 
replace the classical commutators by anticommutators. An equivalent way 
of conceiving the idea of a super manifold is to start with the superdomains 
which are the ringed spaces R’'” = (R’, O”'4) where O”'! is the sheaf 
CC (Om... . . &,]; we then define a super manifold of dimen- 
sion piq as the ringed space obtained by gluing together the R”"”. 

It is clear that one can formulate the notion of a real, complex, or p-adic 
analytic, super manifold by replacing the underlying classical manifold by a 
real, complex, or p-adic analytic one. The natural end of this line of thought 
is the concept of a super scheme. 


The category of super vector spaces and super algebras 


As a basis for the study of super manifolds we have the theory of super vector 
spaces. If A is a ring and we have a direct sum decomposition A = @Aje; 
where the A; are additive subgroups, we say that A is graded by I if I itself 
is an abelian group and A;A; C Aj+; fori, j € 7. The A; are sometimes 
referred to as the homogeneous components of A. The most common index 
group / is Z, the group of integers. If V 1s a vector space and S(V), A(V) are 
the symmetric and exterior algebras of V, then S(V) and A(V) are graded 
in a natural manner by Z. In supergeometry, grading by Z. = Z/2Z is 
fundamental; if A is Z.-graded, the nonzero elements of Ao are called even 
and those of A, are called odd. 

The infinitesimal structure of a supermanifold at a point is captured by the 
properties of its tangent spaces, which are objects of the category of super 
vector spaces, namely the category of Z2-graded vector spaces. This is a 
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category with direct sum @, dual *, and tensor product ®, and is a prime 
example of a tensor category where the objects are abstract and the category 
has the three operations above with suitable functorial properties that imitate 
those of the vector space category. In particular we have isomorphisms cyy, 


cyyv:U@V— VEU 


which (through various requirements such as the hexagon axiom) capture the 
essential properties of the isomorphisms in the super vector space category. 
There the isomorphisms are given by 


Coy 2h vt (—-1)°@" ,@u Ue Ul ve V), 


Here p(x) is the parity of x and is either 0 or 1. These structural assumptions 
explain all the characteristic features of the practical use of super vector 
spaces: sign rule, definition of supercommutativity, super Lie algebras, and 
so on. Moreover they “hide” all the signs in the various definitions. In what 
follows we shall discuss a few of these. The reader should note that almost 
everything makes sense for abstract tensor categories and this more general 
interpretation gives great insight [7a]. Finally the field over which all vector 
spaces, and more generally, the tensor category, is defined, is assumed to be 
of characteristic 0; in applications it is either R or C. 

For instance an algebra in the super category is an object A with a map 
jt: A@A —> A. Itiscommutative (which means supercommutative) if u = 
{£0C,,. For super vector spaces this means the relationab = (—1)?? ba 
between any two homogeneous elements a. b of A, a first example of the sign 
tule: in any classical formula, whenever two odd elements are interchanged, 
there should appear a minus sign. The cuy can be generalized to an action 
of the symmetric group Gy in N letters on V @ V @---@ V (N factors). A 
Lie algebra in the tensor category is an algebra object g with multiplication 
denoted by [-, -], having the properties: 


[ee gee 0 


[-, [-, ‘Wa 1.0 oy =) 


where o is the automorphism of g @ g @ g induced by the permutation 
(123) —> (312). If we unscramble these definitions we obtain the usual 
working definition of a super Lie algebra, as we shall see presently. 
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Historically, super Lie algebras first came up in the work of the physi- 
cists when the first supersymmetric infinitesimal transformations were writ- 
ten down. Gol’ fand-Likhtman [10] and Volkov-Akulov [11] discovered the 
minimal SUSY extension of the Poincaré Lie algebra in the early 1970's. 
Wess-Zumino [12] discovered a little later, in 1974, the first example of a 
simple super Lie algebra, namely the minimal SUSY extension of the confor- 
mal Lie algebra. In 1975 V. Kac [13] formally defined super Lie algebras and 
carried out the super version of the Cartan-Killing classification of simple Lie 
algebras over C. 


Super Lie algebras 


A Lie super algebra or super Lie algebra is a super vector space g with 
a bracket | . | which is a morphism from g @ g to g with the following 
properties: 


(a) [a, b] = —(—1)?P[b, a]. 
(b) The (super) Jacobi identity (unscrambling the earlier version in a tensor 
category) 


la. [b. Cee yeep, [c. Teal) ia ail Ve fa, bj] =): 


This is the working man’s definition of the super Lie algebra. One can hide 
the signs above by rewriting these relations as we did previously, using Cgg 
and the action of G3 on g ®@ g @ g, showing that this is the same as saying 
that a super Lie algebra is simply a Lie algebra in the tensor category of super 
vector spaces. In fact in [7a] there is a beautiful treatment of Lie algebras in 
any k-linear tensor category where k 1s a field of characteristic 0. Clearly (b) 
reduces to the ordinary Jacobi identity for ordinary Lie algebras. 

There is a second way to comprehend the notion of a super Lie algebra 
which is more practical. The bracket is skew symmetric if one of the elements 
is even and symmetric if both are odd. The super Jacobi identity has 8 special 
cases depending on the parities of the three elements a, b,c. By examining 
these we can conclude that a super Lie algebra 1s a super vector space g on 
which a bilinear bracket [ , ] is defined such that 


(a) go is an ordinary Lie algebra for [ , ]. 
(b) g) is a go-module for the action a -— ad(a) : bt > [a. b] (b € g)). 
(c)a ®@ b += [a. b] is asymmetric go-module map from g) @ gi to go- 
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(d) For all a € g;, we have [a, [a, a]] = 0. 


Except for (d) the other conditions are linear and can be understood within the 
framework of ordinary Lie algebras and their representations. The condition 
(d) is nonlinear and is the most difficult to verify in applications when Lie 
super algebras are constructed by putting together an ordinary Lie algebra 
and a module for it satisfying (a)-(C). 

If A is a super algebra, we define 


(a, b] = ab — (—-1)? ©? ba (a,b € A). 


It is then an easy verification that [ . ] converts A into a super Lie algebra. 
It is denoted by A, but often we omit the suffix L. If V is a super vector 
space, we write End(V) for the space of all linear maps V —~ V, regarded 
as a Super vector space with the maps preserving the grading as even and the 
ones reversing the grading as odd. This is a super algebra and we often write 
gl(V) for the corresponding Lie algebra; if V = R?'? we write gl(p|q) for 
gl(V). 
Let g be a super Lie algebra and for X € g let us define 


ad X :g —> g, OD. 0 Ae—|D.G 2) 


Then 
ad: X +> ad X 


is a morphism of g into gl(g). The super Jacobi identity is just the relation 
lad X..ad Y] = ad [X, Y] (X,Y €g). 


The development of super linear algebra including the theory of modules 
over super commutative rings has no real surprises till we come to the gener- 
alization of the determinant, the Berezinian. Let R be a super commutative 
algebra over a field k of characteristic 0 and R”” be the free module of dimen- 
sion p|q over R. Let GL(p|q)(R) be the group of invertible even morphisms 
of R’'!, Then the Berezinian is a morphism of GL(p|q)(R) into Ro (the 
group of units of the even part Ro of R) given by 


Ber(x) = det(A — BD~'C) det(D)~! 


where 
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We have 
Ber(xy) = Ber(x)Ber(y). 


This is the superversion of the determinant, discovered by F. A. Berezin. 
Since the entries of B and C are nilpotent, x is invertible if and only if A 
and D, whose entries are in the commutative ring Ro, are invertible, or even 
only invertible mod R,. The usual determinant in the commutative category 
can be interpreted as the result of an action on the top part of the exterior 
algebra; for the super category the generalization of such an interpretation is 
homological and lies at a deeper level [7a]. 


Functor of points 


In calculations with super manifolds physicists work with coordinates and 
manipulate the odd variables more or less on the same footing as the classical 
commuting coordinates. The functor of points approach is essentially a way 
to make sense of such calculations. 

The philosophical-mathematical principle behind the functor of points ap- 
proach to geometry that is revealed in Grothendieck’s work is that an object 
can be comprehended by its relations with other objects [3b]. Put in this 
manner this seems to be a sort of well-meaning idea without substance. But 
translated into categorical terms and applied systematically it becomes ex- 
tremely powerful. If C is a category and A is an object of C, then the principle 
says that A is completely determined by the functor 


Fa: T t+— F(T) = A(T) = Hom(7, A) 


of C into sets, and that for two objects A,.A2 of C, the set Hom(A), Az) 
is in natural bijection with the set of natural transformations between the 
functors F,4, and F4, (Yoneda’s lemma). The elements of F'4(7) are called 
the T-points of A. 

This notion, applied to the category of super manifolds, leads to a concept 
of of points of a super manifold that is the true geometric one. Let X be 
a super manifold. For any super manifold T we write X(T) for the set of 
maps T —> X and view the elements of X(7) as the T-points of X. If 
T is a single point viewed as a purely even manifold, X(7) is the set of 
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topological points of X. X(T) is a contravariant functor in T and for any 
super manifold Y, the set Hom(X, Y) can be recovered precisely as the set 
of natural transformations X(T) —> Y(T) (Yoneda’s lemma). One can use 
this to define the products of super manifolds: the product X; x --- x Xy is 
the super manifold whose T -points are X|(T) x --- x Xy(T). In this context 
we introduce the usual definition of when a set-valued contravariant functor 
T t+ > f(T) is representable, namely when there is a super manifold M such 
that 7 (7 =f (7): 


Morphisms 
Because the loeal rings € = (7)... ....., 2 x,)(01....6,] are not algebraically gen- 
erated by the x,...,x,, 6),...0,, one has to bevcareful in-definingeaner- 


phisms. Actually the situation is exactly as in the algebraic case, and this is the 
reason why the theory of super manifolds can be developed for the most part as 
the theory of schemes. If M, N are super manifolds and x)....x),0)....9%,% 
are coordinates on N, to define a morphism from M to N it is sufficient to 
specify the images of the x;, 6;. To illustrate this I give a characteristic exam- 
ple of a supersymmetic transformation that mixes the odd and even variables. 
The manifold is R'' with coordinates t, 6), 6. The transformation is then 
specified by ¢ = 1 -0162,0, = 0),0, =). Ie eC (U avliete GR 
is open, the image of g is formally g(t + 6)62); if g is a polynomial in ¢ this 
is obviously well-defined; if g is smooth but not a polynomial, we give a 
meaning to g(t + 6,62) by expanding it as a Taylor series. The Taylor series 
terminates since (6,6>)* = 0. Thus 


g(t + 0,62) = g(t) + g'(t)014>. 


This method is perfectly general. 


Differential calculus 


Because of the above result on morphisms, one has a theory of differentiation 
on super manifolds that is completely analogous to the theory on classical 
manifolds. The differential criteria for a map to be a local diffeomorphism, 
the structure of submanifolds, etc remain essentially the same. Super subman- 
ifolds of super manifolds are defined by local equations whose differentials 
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have maximal rank at each point. For instance if 


Pisces fro Bis oes 8s 


are r even and s odd sections of R’”" such that the matrices 


vi) Op ) 
ox] axd/ 


have respective ranks r, s at 0, then the equations 


fis =fe=g=--=g,=0 


define a sub super manifold locally at 0 of dimension (p — r)|(g — 5s). 


Integral calculus 


Let 
g' =o'g2...o% T= (i,), fi, < ig <--- <i). 
On 
le ee 
the integral is a linear map 
at [eave 


defined by 


[otane a I iflll<q 
1 i fl=Q=/{1,2....,q}. 


Integration is also differentiation: 


| = (soz) (sees) ~ (zar) 


In the local ring with coordinates x', 6/, 


[ sarxaro = [ soa"s (Co D510). 
/ 


le 
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The change of variables formula 


For a morphism given locally as 
yy : (x, 0) > (y, 9) 


we define the Jacobian matrix 


Then 


fs = fv orpery) 


for compactly supported sections of the local ring. For arbitrary manifolds 
we use partitions of unity as in the classical case. This beautiful formula goes 
back to Berezin. The justification for the peculiar definition of integration in 
the anticommuting variables is the change of variables formula. 


The global sections functor 


To any C™ super manifold M we have the super algebra of global sections 
Oy(M) = A(M). Itis possible to show that M can be recovered from A(M). 
More precisely, 

M+— A(M) 


is a functor from the category of smooth super manifolds to the category of 
super algebras which is fully faithful’’. 


Heuristic conception of a super manifold 


The intuitive picture of M is that of a classical smooth manifold |M| sur- 
rounded by a Grassmannian cloud [6e, 8]. The cloud cannot be seen: in any 
measurement the odd variables will be 0 because they are nilpotent. Thus 
measurement sees only the underlying classical manifold |M|. Nevertheless 
the presence of the cloud eventually has consequences that are striking. 

As an illustration of how we use the idea of functor of points even in 
elementary contexts almost without thinking, consider the polynomial ring 
Fit] where F isa field and? is an indeterminate. One can perform calculations 
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in this ring purely abstractly, but somehow when we first encounter it the 
elements of the ring appear elusive. But suppose now we take a ring G 
containing F’; any polynomial f € F[t] can now be evaluated at a point of 
G and so f gives rise to a function from G to G. By varying G we can get 
a complete hold on f through very concrete numerical functions. Suppose 
now we consider the superdomain R'!'. The abstract way 1s to operate on the 
sheaf with elements of the form f + 6g where @ is the Grassmann variable 
and f, g are smooth functions of the real coordinate r. But if T is any super 
manifold and we consider a map of T into R'!', the pull-back of such a map 
will map ¢ into an even section on T and @ into an odd section on T. We may 
thus interpret ¢ and 6 as even and odd sections of T so that R!!'(T) becomes 
identified with the set of all (t, 6). This method of working with the functor 
of points allows us to calculate in a way that is very close to the physicists’ 
way of using @ as a Grassmann indeterminate. 

For a super manifold A, the functor T +> A(T) is huge in the sense 
that its domain of definition is the category of all super manifolds. For 
practical purposes one may want functors with more limited domains that 
may still allow us to recover A. One candidate 1s the category € of all finite 
dimensional exterior algebras over R. If A is one such we can think of A 
as the structure sheaf of a super manifold whose classical reduction is just 
a single point; we may call this a super point denoted by p,. Thus we can 
restrict the functor of points to the category of super points and obtain the 
functor 


E,: At Aq = Hom(pa, A) © Hom(O,(A), A) 


from the category € to the category of sets. However, because the domain of 
this functor is very small, it is not possible to recapture the super manifold A 
from the functor E, as long as we think of it as a functor whose values are just 
sets. But it turns out that the sets A, acquire the structure of a Ag-modules 
(Ao is the even part of A), and Hom(A;, A2) is in natural bijection with the 
natural transformations between the functors E,, and E,4, preserving this 
manifold structure. This follows from the work of Schwarz and Voronov 
[9b]. 

Berezin’s work already had the idea of considering only the exterior alge- 
bra. Actually this idea goes back to A. Weil in classical differential geometry. 
In a beautiful paper (Théorie des points proches sur les vartétes différen- 
tiable, Colloque de Géometrie Differentielle(Strasbourg 1953), C. N. Rus. 
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111-117; also in Andre Weil:Oeuvres Scientifiques:Collected Papers, Vol I, 
103-109, Springer 1979). Weil introduces commutative local algebras of fi- 
nite dimension A and considers the A-points of a smooth manifold to be the 
maps from the smooth functions on the manifold to A. This can be extended 
to the super context by replacing these algebras by supercommutative local 
algebras and has been done in [9c]. The corresponding functors are called 
the Weil-Berezin functors. 

Representability of the various functors is an important issue. But even 
when these functors are not representable, there is some advantage in thinking 
of them as virtual manifolds and the natural maps between them as virtual 


morphisms. 


4.4 Super Lie groups 


Super Lie groups act on super manifolds and so provide the symmetries of 
these new objects. They can be defined categorically as group objects in the 
category of super manifolds. Thus, to say that G is a super Lie group, is to 
say that G is first of all a super manifold, and that we have morphisms 


fu: Gx G— G(multiplication) «:G—-> G (inverse) 


that have the associativity properties usually ascribed to multiplication and 
inverse, together with the identity element 


e: {point} —> G. 


From the perspective of functor of points the functor G(T) takes values in 
groups, so that a super Lie group may be defined as a contravariant functor 
from the category of super manifolds to the category of groups, which is 
representable. The theory of super Lie groups may be built up in the same 
way as the theory of classical Lie groups, but some of the technical aspects 
become more involved because things have to be done using the functor of 
points. 

One can define on any super Lie group the notion of left (or right) invariant 
vector fields, and hence the notion of a Lie algebra of a super Lie group. This 
is a super Lie algebra and the correspondence between G and its super Lie 
algebra g has essentially all the features of the classical correspondence. 

The simplest super Lie group of some generality is GL(p|q). I have nor 
specified the ground field which can be R, C or a nonarchimedean local field. 
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Let us assume for simplicity that it is R. We can think of this as the set of 


P\q block matrices 
AB 
CD 


where the entries of A and D are even variables, and the entries of B and C 
are odd Grassmann variables, with A and D invertible. This means that the 
underlying classical Lie group is Gp = GL(p) x GL(q) with coordinates as 
the entries of the matrices A and D, and the sheaf is 


OG,[B.C]. 


If T is a super manifold and we interpret the entries of A(T) and D(T) as 
even sections on T and those of B(7T) and C(7) as odd sections, then the 
functor of points is 

T +> G(T) 


where G(T) is the group of elements of the matrix 


ee a) 
C(T) D(T) 


with A(T), D(T) invertible. 
Other super Lie groups are, as usual, defined as sub super Lie groups of 
GL(p|q). SL(p|q): This may be defined as the kernel of the morphism 


Ber : GL(p|g) —> GL(1|0) 


where Ber is the Berezinian, or, equivalently, as the subgroup defined by the 
equation 
Ber= |. 
Similarly, but in a more involved manner, we can define the orthosymplectic 
groups OS(p|2q). See [8], pp. 290-294. 
It is possible to discuss these groups as examples of super algebraic groups. 
For a construction of the Chevalley super groups see [14]. 


Super Harish-Chandra pairs 


Of great importance for many applications is a theorem which allows one to 
describe a super Lie group in more concrete terms. Given a super Lie group 
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G we have a pair (Go, g) where Go is the classical Lie group underlying G, 
and g is the super Lie algebra of G. We have 


(1) Lie(Go) = go. . 
(2) Go has an action on g and its differential is the adjoint action of go on 


g. 
Such pairs (Go, g) can be also introduced a priori without reference to a 


super Lie group. They are then called super Harish-Chandra pairs. The 
basic theorem, which is fundamental for us, is the result that 


Gre (Go, g) 


is a covariant functor which is an equivalence of categories from the category 
of super Lie groups to the category of super Harish-Chandra pairs. In terms of 
super Harish-Chandra pairs one can describe actions of super Lie groups on 
super manifolds. This allows a satisfactory treatment of super homogeneous 
spaces X = G/H where G is a super Lie group and H is aclosed super Lie 
subgroup. The functor of points of X is not the functor T +—~ G(T)/H(T) 
but rather the sheafification of it [15, 16a, 16b]. For a concise and beautiful 
presentation of the basic theory of super Lie groups, see [17]. 
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Chapter 5 
Unitary representations of super Lie groups* 


There is still no direct evidence that supersymmetry is a 
symmetry of the physical world, that elementary particles 
arrange themselves in supermultiplets of spin differing by 

half a unit. It must be broken, since if unbroken it would 
predict that the particles in a supermultiplet have the 
same mass. Finding the correct breaking mechanism is 
probably still the basic unsolved problem. 

Bruno Zumino 


The concept of a unitary representation(UR) of a super Lie group is formulated via super 
Harish-Chandra pairs. For super semidirect products the classical Wigner-Mackey theory 
of little groups works perfectly in the supersymmetric setting, and leads to a description 
of all of their unitary irreducible representations (UIR). The Clifford structure of the 
representations and the concept of super multiplets all make sense in the general context 
of super semidirect products, which includes all cases studied by the physicists, and leads 
to many of their major predictions: multiplet structure (both for minimal and extended 


supersymmetry), and the famous susy partners. 


5.1. Unitary representations of a super Lie group 

5.2. Super imprimitivity theorem for even homogeneous spaces 

5.3. Unitary irreducible representations of super semidirect products 
5.4. Super Poincaré Lie algebras and Lie groups 

5.5. Unitary representations of super Poincaré groups 

5.6. Super particles and their multiplet structure 


5.1 Unitary representations of a super Lie group 


In quantum physics symmetries are expressed in terms of unitary operators. 
If P(H) is the projective space of the Hilbert space H of quantum states, and 
we think of a symmetry as a bijection of P(H) that preserves the transition 


” This essay is partly based on the lectures I gave at a conference on supersymmetry 
held in Oporto, Portugal, in 2006. I am grateful to the organizers of that conference for 
their splendid hospitality. I also wish to thank Claudio Carmeli, Gianni Cassinelh, Dan 
Freed, Marian Lledo, and Albert Schwarz for several stimulating conversations during 
the conference. See also [1a, 1b]. 
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probabilities |(@, y) |?, then it is induced by a unitary or anti-unitary operator 
of H, unique upto a scalar of absolute value 1 (phase factor): if L is the 
operator, [wv] H—> [Ly] is the induced symmetry, {w] being the point in 
P(H) corresponding to the vector y. If we want symmetry with respect to a 
topological group G, then on a closed normal subgroup Go of G of index at 
most 2 all symmetries are unitary; in particular, G = Go if G is connected. 
The phase factors may present problems and one may have to go to a central 
extension of the group G to get unitary representations without any phase 
factors. In any case symmetry is ultimately expressed via the notion of a 
unitary representation. 

To formulate the notion of unitary representations in the supersymmetric 
context it is natural to proceed first infinitesimally. The study of unitary 
representations of super Lie algebras began not long after super Lie algebras 
were discovered. The physicists, proceeding infinitesimally, unveiled a rather 
complete picture of the elementary particles and their states in the super world, 
based on the heuristic assumption that the transition to the super world would 
not affect the main outlines of the techniques (little groups) used to determine 
the structure of elementary particles in the usual setting. 

However unitary representations of super Lie groups are a different story 
because of the subtle manner in which a super Lie group differs from an 
ordinary Lie group. A super Lie group is an object which is not a group 
but a group-valued functor, and it is not immediately clear how to mesh 
the functorial approach with the functional analytic aspects that enter when 
dealing with representations that may be infinite dimensional. Extra care is 
therefore needed to define a unitary representation of a super Lie group in 
a possibly infinite dimensional super Hilbert space. The study of unitary 
representations of a super Lie group therefore lagged far behind that of the 
representations of super Lie algebras. 

In this section I shall discuss how to make the correct definition for a unitary 
representation of a super Lie group. It seems likely that further developments 
of the representation theory of super Lie groups may require a theory of 
representations of super Lie groups in Banach and even Frechet spaces, but 
we shall not take up these generalizations here. It is however very important 
to be completely general and not restrict the class of super Lie groups in any 
way, in defining the fundamental concepts. 

Our point of departure is the fact that a super Lie group may be identified 
with a super Harish-Chandra pair (Go. g). This is reasonable, since as I have 
mentioned in the previous essay, the category of super Lie groups is naturally 
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equivalent to the category of super Harish-Chandra pairs. Let me recall that 
(Go, g) is a super Harish-Chandra pair if Go is a classical Lie group and g 1s 
a super Lie algebra with an action of Go on it such that 


(i) Lie(Go) = go=the even part of g. 
(ii) The action of Go on g is the adjoint action of Gp; more precisely, the 
adjoint action of go on g is the differential of the action of Go on g. 


Thus we need not make any distinction between super Lie groups and super 
Harish-Chandra pairs. If we carry over this principle to representations we 
are led to the idea that a representation of a super Lie group (Go. g) may be 
thought of as a triple (7. y, H) where z is an even representation of Go in 
a super Banach (or Frechet) space H (this means that 7(g) is even for all 
g € Go), and y is a super representation of g in H compatible with 2. We 
shall only be interested in the case of unitary representations. This means 
that H is a super Hilbert space, zo is unitary, and y satisfies an appropriate 
condition that reflects the unitarity of the representation at the infinitesimal 
level. It turns out that a natural condition would be to require that y(X) be 
skew self adjoint in the super sense for all X € g. There is in addition an 
important point, technical, but critical nevertheless. To see this, we note that 
for X € g), [X. X] € go and y([X. X]) = 2y(X)*. Now, the operators 
corresponding to the elements of go are typically unbounded, and so the 
operators y(X)(X € g)) will in general be unbounded; this means that there 
are domain considerations to be taken care of. In the next paragraph I shall 
address both of these points. 


The basic definitions 


All sesquilinear forms are linear in the first argument and conjugate linear in 
the second (opposite to the physicists’ convention). A super Hilbert space 
is a Zj-graded Hilbert space H where the H,(i = 0, 1) are closed mutually 
orthogonal subspaces. If (x, y) is defined as i”? (x, y), then (x, y) is an 
even super Hermitian form. For bounded operators T : H —> H the adjoint 
with respect to the super form is denoted by T’ and is related to the usual 
Hilbert space adjoint by T° = T* or —iT* according as T is even or odd. 
We define T° for unbounded T by this rule in terms of T”. 

In order to motivate the definition of unitary representations of super Lie 
groups we begin by observing that in the even case of an ordinary Lie group, 
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the unitarity of a representation 7 of the group is usually expressed at the 
infinitesimal level as follows: if dz is the representation of the Lie algebra, 
then for all X in the Lie algebra the operators d7(X) are skew Hermitian. If 
we carry this over to the super setting the condition should be that the dm (X) 
should be skew super Hermitian for all X € g. This means that for X € go 
we should require that dz (X) should be skew Hermitian while for X € g; 
we should have d(X)* = —idz(X), ignoring domain considerations. Let 


G = ents 
Then this last condition can be written as the condition that 
PCO] Caz), (eg) 


be Hermitian symmetric. Thus at the formal level we define a unitary rep- 
resentation of (Go, g) to be a triple (7, y, H) where z is an even unitary 
representation in a super Hilbert space H, y is a representation of g that is 
compatible with the action of Gg ong, with y = dm ongg and p(X) = fy (X) 
Hermitian symmetric for all X € g;. In other words, unitary representations 
are triples (71, 9, H) with the appropriate compatibility and hermitian sym- 
metry conditions described above. 

To make this definition precise we have to bring in the domains of the 
operators p(X) for X € gj. It is natural to start with the assumption that they 
should have a common dense domain. Initially we shall suppose that this 
domain is the space of all infinitely differentiable vectors for mo, but this will 
be relaxed later and we shall see that even if one starts with a different domain, 
the compatibility and symmetry conditions force the operators p(X)(X € g)) 
to be well defined on the space of all infinitely differentiable vectors for m0. 

We therefore make the following definition. A unitary representation of a 
super Lie group (Go, g) is a triple (7, p, 1) with the following properties: 


(i) 7 is an even unitary representation of Go in the super Hilbert space H. 
(il) p is a linear map of g) into the subspace of odd endomorphisms of 
C™(x). Here C* (sr) is the space of infinitely differentiable vectors for x 
(it is super linear, i-e., it contains the odd and even components of each of 
its elements). 

(iii) p satisfies the requirements below: 


(a) p(80X) = (80) p(X) (go)! (X € gi, go € Go) (compatibility of 
p with zr). 
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(b) p(X) with domain C™ (zr) is symmetric for all X € g,. This means 
that the adjoint p(X)* is an extension of p(X) for X € 1. 
(c) ~idm([X, Y]) = p(X) p(Y) + p(Y)p(X)(X, Y € gi) on C® (72). 


Theorem 1. /f (zr, p, H) is a unitary representation of (Go, g), then for any 
X € gy the operator p(X) with domain C™(z0) is essentially self adjoint, and 


dm : Xo +X, + > da(Xo) +f | p(X1)(Xo € go, X1 € g1) 


is a super representation of the super Lie algebra g in C™ (st), compatible 
with mr. 


The choice of C(t) as the domain for the operators o(X), while natural, 
might seem arbitrary. For instance one might use the analytic vectors instead 
of the differentiable ones. It turns out that this objection is illusory, i.e., 
any alternative system leads always to a unitary representation in the above 
sense. This makes it clear that in spite of appearances our definition of a 
unitary representation of a super Lie group is a very viable and stable notion. 
To make these remarks more precise let us consider a system (z, p, B, H) 
with the following properties. 


(i) B is a dense super linear subspace of H invariant under z, mz being an 
even unitary representation of Go in the super Hilbert space . Moreover 
B c D(dx(Z)) for all Z € [g), 9:], D(A) denoting the domain of the 
operator A. Here for any Z € go, we write —idm(Z) for the unique self 
adjoint operator in H. such that 2(exptZ) = exptdm(Z)) for allt € R. 
(ii) Each p(X) for X € gj, is a linear operator in H with D(p(X)) D B 
such that X +> p(X) is a linear map of g; into Hom(B, 1). Moreover, 


(a) 0(X) is symmetric for all X € g). 
(b) 0(X)B; C Hi+;(mod 2) for all X € gi. 
(c) p is compatible with 7, i.e., 


m(g)p(X)(g) | = p(gX) 


on G for g € Go, X € gi. 
(d) p(X)B C D(p(Y)) for all X, ¥ € g) and 


—idn([X, Y}) = p(X)p(Y) + p(Y) p(s) 


for all X, Y € g; on B. 
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Theorem 2. Let (21, p. B, H) be as above. Then for any X € Qi, p(X) is 
essentially self adjoint on B; and C™ (i) C D(p(X)). Let us write p(X) for 
the restriction of p(X) to C™ (st). Then (10. p. H) is a unitary representation 
of (Go, g). If (a. p’, H) is a unitary representation of (Go. g) such that 
BC D(p'(X)) and p'(X) restricts to p(X) on B for all X € gy, then ip =p. 


Remark. For operators A, B in H write A < B if D(A) C D(B) and B 
coincides with A on D(A). Let A be a densely defined symmetric operator 
which is essentially self adjoint. Then A, the closure of A (which exists 
because A is symmetric), is the unique self adjoint extension of A; moreover, 
if B is any symmetric operator such that A < B, we have A = B so that 
A < B < A = B. Thus, as a consequence of the first statement in Theorem 
2 we see that extension by closure of p(X) is self adjoint and its domain 
contains C™(z:). This is the key fact in the theorems. 


In the classical theory of representations of semi simple or reductive Lie 
groups the K -finite vectors play a decisive role. The space of K -finite vectors, 
although in general a subspace of the space of analytic vectors, is not invariant 
under the group but only under the Lie algebra. So it would appear useful to 
have a variant of Theorem 2 where B is a subspace of C“(z), the space of 
analytic vectors of 2, with B stable under g rather than Go. We then have the 
following theorem. 


Theorem 3. (7) /f (7. p. H) is a unitary representation of (Go. g), then the 
p(X) for X € g; map C°(x) into itself, and dx as in Theorem | is a rep- 
resentation of g in C°(zr). (ii) Let Go be connected, let B be a dense super 
linear subspace of C°(x), and let da be a representation of g in B such 
that dn(Z) ~« dxo(Z) for all Z € gy and p(X) = Cm(X) symmetric for 
X € g). Then for any X € gy, p(X) is essentially self adjoint on B and 
C™°(x) C D(p(X)). Let us write P(X) for the restriction of p(X) te C™ Gr). 
Then (1, —. H) ts a unitary representation of (Go. g). Finally, if (2, p'. H) 
is a unitary representation of (Go. g) such that BC D(p'(X)) and p'(X) 
restricts to p(X) on B for all X € g, then p' = P. 


With these theorems we have realized our objective, namely to introduce a 
viable category of unitary representations of a super Lie group. A morphism 
from Tl = (z. p. H) to M1’ = (2’, p’. H’) is a bounded linear map A(H —> 
H’) that intertwines zr, 7’ and p, p’. Note that as soon as A intertwines 7 
and zr’, it maps C™ (zr) into C™(zr'), and so the statement that A intertwine 
p, p makes sense. If A is an isomorphism and unitary we speak of unitary 
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equivalence. IT’ is a subrepresentation of 1 if H’ is aclosed graded subspace 
of H invariant under 7, H’ M C™(s) is invariant under p, and zr’ (resp. p’) 
is the restriction of 2 (resp. p) to H’ (resp. HM C™(z)). If Misa proper 
nonzero subrepresentation of 11, H” = H’", and 2" is the restriction of z to 
H", then C™ (sr) is the direct sum od C™ (zt) and C™ (sr), and it follows from 
the essential self adjointness of o(X)(X € g)) that C’(2”) is stable under 
all the p(X); if p” is the restriction of p to C\(2"), then 11” := (2, p”. H”) 
is a unitary representation of (Go. g), and NM = M1’ @N”. 17 is irreducible 
if the only subrepresentation of IT that is non zero is [1 itself. We have the 
following theorem. 


Theorem 4. (7) [ is irreducible ifand only ifHom(1], 11) = C. (ii) Let N1 be 
aunitary representation of (Go. g) and let P* be the spectral measure of p(X) 
for X €q. IfH' is aclosed super linear subspace of H', then the following 
statements are equivalent: (a) H’ is stable under nm and H' \C™ (zr) is stable 
under all p(X)(X € g,) (b) His stable under x and all the projections P*, 
F Borel and C R. In particular, 1) is irreducible if and only if there is 


no nonzero proper closed super linear subspace invariant under 1 and all 
x 
PX(X € gh). 


Some comments on the proofs 


The most striking aspects of the above theorems are the essential self ad- 
jointness of the p(X)(X € gj) and the fact that C° (7) C D(p(X)). It is 
clearly because of these facts that no matter what subspace B we start from, 
all the operators o(X) extend naturally to C™ (zo) and we obtain a unitary 
representation in the precise sense we have defined. I shall try to explain the 
idea behind proving these facts. 

First let us introduce some definitions. If A is a closable operator and 
D c D(A) a dense linear subspace, we say that D is a core for A if A 
is the closure of its restriction to D. Let A be symmetric. We say that a 
vector yw € D(A) is analytic for A if w € D(A") for all n and the series 
>, tA" wl|/n! converges for some t > 0. It is a well known result that if 
A Cc D(A) contains a dense set of vectors analytic for A then A is essentially 
self adjoint and A is a core for A. 

In SUSY quantum mechanics the key assumption is that the Hamiltonian 
H = L?, L being an odd symmetric operator. It is usually suggested that 
this assumption implies the positivity of the energy operator H. Actually this 
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condition has deep consequences, not just to the positivity of the energy but 
to our entire theory. Let us consider the context of Theorem 2. Then, for 


X € g), we have on B, 
20(X)* = p([X, X]) = —idm([X, X]). 


Now . 
U, = m(expt[X, X]) = e''* 


where K is self adjoint by Stone’s theorem. Under our hypotheses 
Kk — 25005 


on B. The key lemma in the proofs of the theorems stated above is the 
following. 


Lemma. Let H be a self adjoint operator and U, = e''". Let A C D(H) be 
a dense linear subspace. Assume that A satisfies one of the two conditions: 
(a) A is invariant under the one-parameter group (U,) (b) A contains a dense 
set of vectors analytic for H. We then have the following. 


(i) Ais acore for H. 

(ii) Let L be asymmetric operator with A C D(L) such that LA C D(L) 
(so that A C D(L?)) and L? coincides with H on A. Then L is essentially 
self adjoint and A is a core for it. Moreover, H = jaar particular, 
H 20, DG) eDp@). 


Remark. This lemma not only gives the precise conditions under which we 
can say that a SUSY Hamiltonian is positive, but also shows how the “odd 
square root” of the Hamiltonian is itself controlled by the Hamiltonian. 


Additional remarks 


The basic reference for all of this is [la]. But, already in 1986, Hirai and 
Fronsdal had explored the idea of unitary representations of general super- 
groups in [2]. However their treatment is only formal; there is no discussion 
of domain considerations for the odd operators, which is a crucial part of the 
treatment in [la]. Nevertheless the idea of working with pairs (Go, g) can be 
found in [2]. Part of the reason for the absence of a more precise treatment 
in [2] must have been the fact that there was no viable definition of super Lie 
groups available at that time. On the other hand, the physicists’ treatment 
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goes back much earlier to the early 1970's, almost immediately after Salam 
and Starthdee got the idea of super manifolds (see references [4a, 4b] of the 
previous essay), but was entirely infinitesimal and confined only to the super 
Poincaré algebras. I shall make remarks on their results later. 


Positive quadratic structures on g, 


The self adjointness of the p(X) implies that o(X)? is a nonnegative self 
adjoint operators for all X € g;. Thus we have a positive quadratic structure 
on g;, and hence a positive Clifford structure. Thus one of the basic principles 
of the theory of unitary representations of super Lie groups already emerges 
at this level, namely, it is a combination of unitary representation theory 
of ordinary Lie groups with the representation theory of positive Clifford 
algebras. 


5.2 Super imprimitivity theorem for even homogeneous spaces 


For ordinary Lie groups the central results of representations of semi direct 
product Lie groups follow as an application of the imprimitivity theorem 
which establishes a functorial equivalence between the category of unitary 
representations of a closed subgroup Ho of the Lie group Go and systems of 
imprimitivity for Go based on Q = Go/ Ho. Actually the imprimitivity theo- 
rem for finite groups goes back to Frobenius. In 1939, Wigner [3] developed 
an extension of the Frobenius theory for the Poincaré group, and then, in the 
late 1940s and early 1950s, Mackey proved a far-reaching generalization of 
the Frobenius theory, valid in the enormously much wider category of locally 
compact second countable groups; see [4]. Mackey’s theory was applica- 
ble to the semi direct products and contained the Wigner theory as a special 
case. We are interested only in the case of real Lie groups. In the next essay 
we shall discuss these ideas when spacetime is nonarchimedean so that the 
groups involved are p-adic Lie groups. 

Let Go be any locally compact second countable group and &2 a locally 
compact second countable space on which Gy acts. A system of imprimitivity 
(SI) for Go based on Q is nothing more than a representation of Go together 
with data that encode the Go-action on 2. More precisely it is a pair (7, P) 
that consists of a unitary representation 7 of Go in a Hilbert space 1 and a 
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projection valued measure P on 92 in 1 such that 
(SD n(g)Pen(g) |= Pee (g € Go, E Borel C &). 


Apart from its intrinsic interest it is also a notion that is essential in formulat- 
ing the concept of covariance of quantum systems, as was first emphasized 
by Mackey. Suppose in fact that Q is the configuration space of a quan- 
tum system, and we want to formulate the idea that our description is to be 
symmetric with respect to Go. The functions on Q are the “coordinates” that 
determine the “configurations” of the system considered and so we should re- 
quire that we have “position operators” that correspond to them. This means 
that for each (Borel) subset E C Q we have a projection operator Pr in the 
quantum Hilbert space H the defines the experiment that decides whether the 
configuration of the system is located in E. The assignment 


Et Pr 


should be a projection valued measure. Then covariance with respect to Go 
means that there is a unitary representation U of Go in H. such that 


Up aie: ESE ys 


For instance, if & = R” and Go is the group of translations this leads to the 
Heisenberg-Weyl systems. From the point of view of quantum mechanics it 
is of great importance to know how to describe all systems of imprimitivity 
for a given (Q2, Go). 

We can also make a smooth version of this notion if we replace P by its 
smooth version, namely, a *-homomorphism 


Teas) 


of the algebra CQ) of smooth compactly supported functions on Q into the 
*-algebra of bounded operators in H, such that 


UpA(f)U,' = Asif) (g € G, f € CRD), 


where g| f] is the transform of the function f by the group element g. 

Actually, one can make a vast generalization of this as follows. We think of 
<2 not as a configuration space but as “spacetime.” For each open set E c Q 
we suppose that have a +-homomorphism 
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into the algebra of bounded operators in H., which encode the measurements 
we Carry out in E (which are in the quantum algebra associated to E). Then 
the covariance is formulated as the relation 


U.A(f)Us! = A(gLf I), 


where g[ f] is the transform of the function f by the group element g. The 
A(f) will not in general commute with each other but will do so, if the 
supports of f and g are space-like separated in the relativistic sense, namely, 
when light signals from one of them cannot reach the other. Then such 
a system will be the starting point of a quantum field theory, as was first 
discovered by Wightman. The fact that the operators A(f) commute only in 
a very restricted fashion makes the determination of such systems extremely 
hard. 

Let us come back to systems of imprimitivity. Even here we have to do 
some preliminary reduction because Go may not act transitively on Q. Since 
the conditions combine nicely when we form the direct sum of two systems 
associated to two actions of Go, it follows heuristically that any system is a 
direct sum (or integral) of systems that correspond to the orbits where the 
action is transitive. This is in fact an over simplification as it ignores the 
existence of ergodic actions. For instance, the classic Lebesgue action of Z 
on the circle T defined by an irrational rotation is not transitive but ergodic. 
In any case, one still has to understand systems of imprimitivity based on 
transitive actions before going on to treat ergodic systems. 

For a transitive action the story is very simple: if Ho is the stabilizer of 
some point wo of Q, the category of transitive systems on & is functorially 
equivalent to the category of unitary representations of Ho. This is Mackey’s 
great theorem, which is a culmination of a line of thought that started with 
Frobenius, and evolved through the ideas of von Neumann, Wigner, Gel’ fand, 
and possibly others. 

If o is a unitary representation of Ho in a Hilbert space K’, then one 
can associate canonically to o a SI (7°, P”) for Go based on 82: 7° is the 
representation of Go induced by o and P°® is the natural projection valued 
measure on Q in the space of 2°. Here we must recall that the Hilbert space 
H° of 1° is the space of (equivalence classes of) measurable functions f 
from Go to K° such that 
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(a) f(xé) = a (E)~! f (x) for each € € Ho for almost all x and 
(b) f Rages rer < oo (in view of (a) the K’-norm of f(x) is really 
defined on Q, and dx is the invariant measure on @.). 


The representation 7° acts by left translation. For any Borel E Cc & the 
projection P? is the map y +> xew (W € H”) where xz is the charac- 
teristic function of E. For the smooth version, where P” is replaced by A”, 
the operator A°(f) is multiplication by f: w +> fw. The imprimitivity 
theorem is the statement that 


or (m°, P°) or or (1°, A") 


is a functorial equivalence from the category of unitary representations of Ho 
to the category of SIs based on 2. The representation 7 is said to be induced 
by o. 

The above definition is not very intuitive and I shall now describe induced 
representations in a more heuristic manner. We think of A-” as “attached” 
to the point wy which defines the stabilizer Hp. If w = g[wo], we attach a 
Hilbert space % to w and a unitary isomorphism L, : ° —> K°. In this 
manner we have a “vector bundle” on & on which Gy acts. The induced 
representation is then the natural action of Gy on the space of sections of this 
bundle. One can make this rigorous as follows. We take Gy x K° on which 
Go acts from the left, and introduce the equivalence relation 


(g,W) ~ (g&, 0(€)'W). 
It is easy to see that the set 
Dee Go x | gs ~~ 


of equivalence classes can be interpreted as a vector bundle over Q whose 
fibers are isomorphic to K°. The action of Gy on Go x K° comes down to an 
action on V° and hence leads to an action on the space of sections of V7. The 
sections acquire a natural L?-norm and the space H° is just the subspace of 
square integrable sections. If Go is a Lie group and K° is finite dimensional 
(not necessarily unitary), the assignment 


Co =a 


The theory can be developed even if there is no invariant measure, but for most appli- 
cations invariant measures are available. 


5.2 Super imprimitivity theorem for even homogeneous spaces 133 


is just the classical functor that goes from the category of Hy-modules to the 
category of vector bundles with a Gy-action. Important geometric examples 
are the various tensor and spinor bundles on Q. It must be emphasized that 
the functor 


a 


(OF | ate 


is not an equivalence of categories in general, but the functor 
ot (n°, P”) 


is, according to the Mackey theorem. 

It is Well known how this theorem leads in a straightforward manner to the 
famous Mackey-Wigner classification of irreducible unitary representations 
(UIRs) of semidirect products Go = Ty x’ Lo where 7p is an abelian group 
with an action h, a +—> h[a] of Ly on it. A UR W of Go gives rise to pair 
(U. V) of URs of Ty and Lo by restriction which are coupled by the relation 


V(h)U(a)V(h)' = U(h[a}) (edge Ia) 


We diagonalize U by expressing it as an integral with respect to a projection 
valued measure P on the dual group 7 of 7), and the above relation translates 
into 

V(A)PeV(h)~| = Pye,  (h € Lo, E Borel C 7). 


The support of P is the spectrum of W. If W 1s irreducible, P is ergodic in 
the sense that for any H-invariant Bore] set E above, Pe = Oor J. Indeed, as 
h{E| = E, Pe willcommute with V by the above relation, while it commutes 
with U obviously, and so will commute with W. Suppose we assume that P 
is concentrated in a single orbit for Lo. Then by restricting to that orbit we 
obtain a transitive system of imprimitivity. If x is a point in the supporting 
orbit and L, is the stabilizer of x in Lo, we thus see that the (equivalence 
classes of) UIRs of L, classify up to equivalence the UIRs of Go whose 
spectrum is the given orbit. If the representation of L, is irreducible, so will 
the representation for Go, and vice versa. 

For the general case it might happen that there are ergodic systems whose 
spectrum is not concentrated in a single orbit. In this case there will be many 
more UIRs of Gy than are accounted for by UIRs of the various L,. This is 
however a situation not encountered in many natural applications. Suppose 
now we assume that the semidirect product is regular; this means that the 
action of Ly on Tp is nice in the Borel sense, i.e., there is a Borel cross section 
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for the orbits. In this case all ergodic systems have spectra concentrated in 
orbits. Every UIR of Go is then obtained by this method, by varying the orbit 
and the UIR of the L,. The groups L, are the little groups of the physics 
literature that go back to Wigner when he classified the UIRs of the Poincaré 
group in his famous paper in 1939 [3]. The general theory described above 
is due to Mackey [4]. 

Effros has proved the beautiful theorem that under very general conditions 
semidirect products are regular, for instance if the Lo-orbits in ie are all 
locally closed, namely, are open in their closures [5]. This is the case for 
instance when the actions are algebraic. 

I shall now formulate a generalization of the imprimitivity theorem to 
the super setting. This will lead to a classification of irreducible unitary 
representations of super semi direct products. To this end we work with a 
super Lie group G = (Go, g) and a closed super subgroup H = (Ho, h). We 
shall assume that the subgroup is special, namely that h; = gj), or that the 
odd part of § coincides with the odd part of g. This means that the associated 
homogeneous space is classical, namely 8 := G/H = Go/Hpo. It is an 
interesting problem to remove this restriction. However this condition is 
adequate enough to provide a solution to our main problem, that of classifying 
the irreducible unitary representations of regular super semidirect products. 
All the results in the physical literature are contained in this treatment since all 
super Poincaré groups and their variants encountered in physics are regular 
super semidirect products. 


The super imprimitivity theorem 


Let H = (Hp, 6) beaspecial subgroup of G = (Go, g). A super system of im- 
primitivity (SSI) for (Go, g) based on 2 = Go/ Hp is a system (77, p7, H, P) 
with the following properties: 


(1) The triple (7, op”, 1) is a unitary representation of the super Lie group 
(Go, g) 

(2) The triple (7, H, P) is a classical system of imprimitivity for Go in H, 
based on &2, P being even 

(3) The projection valued measure P commutes with p” ; this means that the 


spectral projections of the o7(X)(X ¢€ gi) commute with the projections 
Pete Borel © &), 
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The last condition, which is the new ingredient in the super setting, may be 
explained as follows. The projections of P give rise to *-homomorphism 
f +> A(f) of the commutative algebra of bounded Borel functions on Q 
into the algebra of bounded even operators on H. The additional requirement 
of compatibility between the odd part of the super representation and the 
projections of P may be formulated infinitesimally by saying that 


[o"(X), ACPI =ACX(f)) — (X E gi, f € CP(Q)); 


but, as Q = G/H = Go/ My is purely even, X f = 0 when X is odd and f is 
even. This is encoded by (3). 

We shall now associate to any unitary representation (a. p”. K° = K) of 
the super Lie group (Hp, h), a SSI of (Go. g). To this end we must define z 
and p”. We shall use the flexibility provided by Theorem 2 above and define 
the p”(X) on a suitable space B. We define 2 by 


me Go 
na = ind Ho? 


in the Hilbert space H introduced earlier. The super Hilbert space structure 
of K gives rise to a super Hilbert space structure for H and m2 becomes 
even. Lewie= C (r), thespacerof C™ veetors for m in H which, as 
functions from Gy to K, are smooth and have compact support mod Hp. 
It can be shown (as a consequence of the Dixmier-Malliavin theorem) that 
B is precisely the space of all smooth functions f from Go to K such that 
f(x&) = o (E)7! f(x)(x € Go. € € Ho) and have compact support mod Ho. 
It is then easy to see that the values of any f € B lie in C*(c) so that all the 
operators p7(X)(X € h; = g)) act onall the values of any f € B. We define 


(p™(X)f)(x) = p? (xX) f(x) (f € B, x € Go). 


Of course the projection valued measure is the standard one associated to 77. 
We then have the following theorem [la]. 


Theorem 5. The quadruple (7, p”.B, P) is a SSI for the super Lte group 
(Go, g) based on 2 = Go/ Ho. The assignment 


(o, 9°, K) +> (7, p”, P) 


isa functorial equivalence of categories from the category of unitary represen- 
tations of (Ho, h,) to the category of SSIs for (Go. g) based on S="Gol tor in 
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particular we have an equivalence preserving byection from UIRs of (Ao, 5) 
with irreducible SSIs for (Go, g) based on Go/ Ho. 


Remark. There are quite a few technical details to be verified in the proof 
of the Theorem. I refer the reader to [1a]. 


5.3 Super semidirect products and their unitary irreducible 
representations 


There are several places in the above treatment where we have made new def- 
initions based on heuristic considerations. It is now essential now to discuss 
applications and make sure that our definitions are justified at least ex posto 
facto. We shall do this by discussing UIRs of super semi direct products. 

We start with a classical semi direct product Go = Ty x’ Lo where Tp is a 
real finite dimensional vector space (the spacetime translation group) and Lo 
a closed unimodular subgroup of GL(7p) acting naturally on 7p. We shall 
assume that the semi direct product is regular, namely, that for the action of 
Lo on the dual 7 there is a Borel cross section; this is equivalent to saying 
that all the Lo-orbits in J) are locally closed (Effros). Let tp = Lie(7o), lo = 
Lie(Lo), 9o = Lie(Go). 

I have remarked that in the construction of super Lie algebras g = go @ g1 
the difficult condition to satisfy is the cubic nonlinearity 


[X.[X,.X]]} =O  (X €g)). 


One way to take care of this is to ensure that the odd bracket goes into a 
subspace g2 C go which acts trivially on g,: 


[92,91] =0 


The triple bracket condition above is then automatic. In what follows we 
must remember this. 

By a super translation group we mean a super Lie group (7p, t) where to is 
abelian and acts trivially on t,. Note that as [t;, t;] C t) and t acts trivially 
on t), the condition for a super Lie algebra is automatic. Suppose now that t, 
is an Lo-module and that the super commutator map a, b +> [a, b] is Lo- 
equivariant from t; x t; into to. Then g := lp @ tis a super Lie algebra with 
Go = lo ® to = Lie(Go), g; = t), for the same reasons as for t, namely, that 
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the odd bracket goes into tp which acts trivially on t;. The super Lie group 
S := (Go, g) is the super semi direct product of Lo and the super translation 
group (70, t). In this manner we have obtained a whole class of super Lie 
groups (which are algebraic if Lo and its actions on to and t; are algebraic); 
the data needed for their construction are Lo, and Lo-modules to, t), together 
with a symmetric Lo-equivariant bilinear map of t; x t; —> to. 

For any closed subgroup Sp C Lo we have Hp = Tp So is aclosed subgroup 
of Go and (Ho, h) is a special super Lie subgroup of (Go, g) where h = ho ty, 
ho being Lie(Ho); if $9 = Lie(So), then hp = $9 ® to. The super Poincaré 
groups that occur in the physical literature are special cases of the above 
construction where Lo is an orthogonal group (rather its two-fold cover, the 
spin group) with respect to a nondegenerate quadratic form on to, and t; 
is chosen as a spin module for Lo. But in this section we shall discuss 
the general case. We are interested in describing all the unitary irreducible 
representations of (Go, g). 

The aim is to show that the irreducible unitary representations of the su- 
per semi direct product may be classified by a straightforward extension of 
the Frobenius-Mackey-Wigner theory, namely by the method of little super 
groups. For any 4 € T; let L2 be the stabilizer of 4 in Lo and let 


g =O @g1, % =Lie(L>). 


We shall refer to the super Lie group S* := (ToL{, g*) as the little super 
group at X. It is a special sub super Lie group of S = (Go, g). Given 
a unitary representation (o, p”) of S*, we shall say that it is A-admissible 
if o(t) = e?7 for all t € Tp; A itself is called admissible if there is an 
irreducible unitary representation of S* which is A-admissible. Let 


t= {> eT; 


2 admissible | 


It is easy to see that J)” is Lo-invariant. 
Given a unitary representation (zr, p”) of the super Lie group S we obtain 
a spectral measure on P on T* by Fourier transformation of the restriction of 
mz to To: 
eS / e*dP(A) (t ET). 


If E is any Lo-invariant Borel set, the spectrum of the unitary representation 
is said to be in E if Pg = I. 
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Theorem 6. The spectrum of every irreducible unitary representation of the 
super Lie group S = (Go. g) is in some orbit in T,. For each orbit in 
T,’ and choice of i in that orbit, the assignment that takes a i-admissible 
unitary representation y ‘= (a. p°) of S * into the unitary representation U” 
of (Go. g) induced by it, is a functor which is an equivalence of categories 
between the category of the 2-admissible unitary representations of S* and 
the category of unitary representations of (Go. g) with their spectra in that 
orbit. Varying i in that orbit changes the functor into an equivalent one. 
In particular this functor gives a bijection between the respective sets of 
equivalence classes of irreducible unitary representations. 


Proof (Sketch). Since Ty acts trivially on g, it follows that for any uni- 
tary representation (7, 9”) of S the operators w(t)(t € 7) commute with 
p”(X)(X € g,) and so the spectral measure P and p” commute with each 
other. If (7, 7) is irreducible and E is a Ly-invariant Borel set, Pe commutes 
with z also, hence with (77, 0”), and so P, = 0 or /. Thus, by regularity 
of the semi direct product, Pp = / for some orbit O. If A € O we can 
view P as a projection valued measure on Ly/L%. Clearly (7. p”. P) is a 
SSI for S based on Lo/L’. By the super imprimitivity theorem this is the 
SSI associated to a unitary representation (co. 0”) of S*. From the classical 
theory one knows that o(t) = e!“"] forall t € Ty. If (7. p7) is irreducible, 
(a, p° ) is irreducible, and so 4 is admissible, i.e., A € T*. 


Remark 1. The reader should notice the sharp difference between the classical 
and super symmetric situations. In the classical theory, a// orbits in 7) are 
allowed, while in the SUSY case only those in 7,” are. Actually, when 
everything is even, 7," = 7; in fact, givenmany A, the mapas + er 
To. x € Li) is well defined and gives an one dimensional irreducible unitary 
representation of TyL4, thus showing that A € T,) . But when super symmetry 
is present, the restriction to 7," is a genuine selection rule. We shall see below 
that 7," may be interpreted as the condition of the positivity of energy. This 
is very different from the classical theory where representations of negative 
energy (tachyons) are theoretically possible and have to be excluded ad hoc. 


Remark 2. One other point should be noticed. If we restrict the UIR of (Go. g) 
to Go we obtain a UR of Go that is not in general irreducible. In principle the 
irreducibles in that restriction may belong to different orbits. But the theorem 
shows that all the irreducibles in that restriction belong to the same orbit. The 
reason for this is that the odd operators commute with the translations. This 
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means that all particles in a super multiplet, the so-called parters, have the 
same mass. Of course supersymmetry breaking will eventually endow the 
partner particles with differing masses. 


Suppose 4 € Ty and (0, p”) is a unitary representation of $* such that 

a(t) =e" for all t € T>. Then 
—ido(Z)=A(Z)I_ (Z € to). 
On the other hand, if X;, X2 € g), then [X;, X>] € to. Thus 
[e°(X1), p° (X2)] = 20)(X1, X21 (KX, X2 E 1) 
where ®) is the symmetric bilinear form on g, x g) defined by 
D,(X1, X2) = (1/2)A((X1, X2)). 
If we now take X; = X> = X we see that 
p(X)? = Q(X), Q(X) = 0,(X,X) (X € gi) 
on C“(a). Since p°(X) is essentially self adjoint on C™(c) it follows that 
OO) 20 (Xe): 


This is thus a necessary condition that there should exist a unitary represen- 
tation (o, p”) with o(t) = e!* J for all t € Tp. 
We also note that QO, 1s invariant under Lf because for X;, X21 €g),.h € Ly, 


ip" GaX deo? Hel) = G0" (2 p° alothy | = O7(%), Xr). 


Theorem 7. For ad € T; the following are equivalent. 


(a) There is an irreducible unitary representation (a, p°) of S* witha (t) = 
eT for allt € To, that is, that d is admissible. 
(b) There is a unitary representation (a, p°) ofS” with o(t) = eT for 
allt € To. 
(c) O,(X) = O forall X € g). 
Remark. When the super semi direct product is specialized to the super 
Poincaré group, the condition Q, > 0 is the same as the positivity of the 


energy. This is the reason why we refer to this condition as the positive 


energy condition in the general case. 
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Clifford structure when Q, > 0 


To complete the proof of Theorem 2 we must show that if Q, > 0 then there 
is an irreducible unitary representation (o, p”) of S* such that a(t) = ery 
for all t € Tp. Actually it is necessary to construct all such so that we can 
get a description of all irreducible unitary representations of the super semi 
direct product § = (Go, g). This depends on exploiting the Clifford structure 
that arises from the quadratic form Q,. The form Q, is invariant under Dg. 
Moreover the condition p?(X)? = Q,(X)/ shows that the operators p° (X) 
are bounded self adjoint so that the situation is much nicer than for the big 
group. We define C, as the quotient of the tensor algebra generated by g; with 
respect to the relations X* = Q)(X)(X € g)). If Q, is non degenerate this 
would be the Clifford algebra associated to the quadratic form Q,; by abuse 
of language we shall refer to it as the Clifford algebra associated to Q, even 
if Q) is singular. 

Clearly p° may now be viewed as a representation of C, in a super Hilbert 
space K such that p° (X) is odd, bounded, and self adjoint for all X € g, (SA 
representations). Now Li acts on g; and leaves Q, invariant. Hence its action 
lifts to an action x, a +> x[a] on C,. Then the unitary representations of S* 
are pairs (o, tT) where t is a SA representation and o a unitary representation 
of TL in the space of t such that 


o(t) = ez Elo), tela) = oc (Ghetx) | foraeC,xe Ge 


Description of the UIRs of S* 


While there is nothing really difficult in the determination of the pairs (0, T), 
it is technically involved and I do not want to go into all the details. This 
is because we are in the context of a completely general super semi direct 
product. One of the difficulties is that L, need not be connected. Another is 
that g; mod the radical of Q, may have an odd dimension. Neither of these 
arises for super Poincaré groups. Let us assume that Li is connected. 

Since Q; may have a radical, say t,, we introduce 


Gia = G/T). 


Then Q) defines a non degenerate quadratic form on g;, which we denote by 
Q,. Since li preserves Q) and is connected, we have a map 
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J 5 Lo ——> SO(g1a). 


Then there are two possibilities: (1) The map j lifts to a map (this ts the case 
when Li is simply connected) 


Fis Ee =———> Spintg),). 


(2) j does not lift as in (1) but there is a two-fold cover H* of 1b such that 
J lifts to a map 
Fie Hess Spin(gi,). 


In case (2) the covering map 
H* — Li 
has kernel {+1}. We write & for the non trivial element of this kernel. 


Lemma 1. We have the following. 

(a) There exist irreducible SA representations t of C,. These are finite di- 
mensional, unique if dim(g,,) ts odd, unique up to parity reversal if dim(g\,) 
is even. Any SA representation of C, is of the form 1 ®t. 


(b) Let t, be an irreducible SA representation of C,. Then, assuming that 
Li, is connected, there is an even UR x, of H*, unique up to multiplication 
by a character of H*, such that 


t (x[a]) = o(x)t,(a)o (x)! fora €C,,x € H%. 


The necessary and sufficient condition for x, to be a UR of Li, is that either 
j lifts to a map 1 —» Spin(g),) or there exists a character x of H* with 
x(&) = —1. This is the case if 


(C) LSA xT. A is simply connected, T is a torus.. 


. . . . . Xr 
Moreover k, is unique up to multiplication by a character of Lo. 


Proof (Sketch). (a) is a variant of the standard theory of Clifford algebras. 
Let us now look at (b). The spin group Spin(g;,) is imbedded inside the even 
part of the Clifford algebra of g), and 7, may be viewed as a representation 
of this Clifford algebra. Its restriction, u say, to the spin group Is than an even 
unitary representation of the spin group compatible with the action of gi,. If 
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now LA is simply connected, or more generally, if 7 lifts toa map j asin case 
(1), we may take ay = wow. If j exists only on a two-fold cover, K, = 40 j 
defines the unitary representation we seek. It may happen that me (Se = —Tit 
is +1): in this case changing x; to xx) we get a unitary representation which 
descends to 14, which we may take as «,. Finally, if Lj, satisfies (C), we can 
take H? = A x'T> where T> is atwo-fold cover of T, and € = (1, t) where 
1eT°.t £1. = 1. There exists a character x of T~ which is —1 at f, 
and we can extend x to H* by making it trivial on A. 


Theorem 8. Let) be such that Q, > Oand suppose that condition (C) on ee 
is satisfied. Let py, = Talay and let 0, = e'*k, be the unitary representation 
of ToL. We then have the following. 

(a) (ay, Py) isan irreducible unitary representation of the little super group 
S* which restricts to e'” on Tp. 

(b) The unitary representations (a, p”) restricting to e* on To are precisely 
those of the form 


=2 © Smee 0” =i, 
where 0 is a purely even unitary representtaion of Li. 


(c) The assignment 
OH> (80 @erK,.1® pr) 


isan equivalence of categories from the category of purely even unitary rep- 
resentations of L to the category of unitary representations of the little super 
group S* which restrict to e'* on To. 


Remark. The pair (e'*«,. ,), which corresponds to the case of trivial 6, 
is called the fiwidamental representation of S*. Once it is determined, the 
whole structure of the irreducible unitary representations of the original super 
semi direct product S is made transparent. The determination of «, and the 
verification of the condition (C) are thus the crucial ingredients that remain 
to be described in the super Poincaré case. 


Generalizations 


It is a natural question to ask whether we can develop the theory of infinite 
dimensional representations of more general classes of super Lie groups. 
There have been many cases treated in the literature, much prior to [la]. 
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Jakobsen [11] constructed all highest weight unitary modules for classical 
super Lie algebras, while Dobrev et al [12] studied the representations of 
the conformal supergroups. Recently Salmasian [13] has extended the entire 
Kirillov theory and obtained a complete theory of unitary representations of 
nilpotent super Lie groups. It is clear that there is a whole lot to be done in 
these directions. 


5.4 Super Poincaré Lie algebras and Lie groups 


It is probably a good idea now to look at the groups that are studied extensively 
by the physicists. These are the various super Poincaré groups and algebras 
and variants of them with additional charges, central or otherwise. 

The Poincaré group is a semi direct product of the group of spacetime 
translations and the Lorentz group. For applying supersymmetry to physical 
problems such as the structure and classification of particles with supersym- 
metry it is very useful to have supersymmetric extensions of the translation 
and Poincaré groups. We begin with the translation group. We use the the- 
orem of super Harish-Chandra pairs and think of a super Lie group G as a 
super Harish-Chandra pair (Go, g). 


We have already defined the super semidirect products. The super Poincaré 
groups occurring in physics are very much more special than these. First of 
all to is required to have a nondegenerate symmetric quadratic form, namely, a 
metric, and Lo must be the corresponding orthogonal group or the spin group. 
Furthermore from the beginning of the discovery of supersymmetry, the space 
t; was supposed to be spanned by the so-called fermionic or spin charges. 
Essentially what this means is that t; must have a spin structure, or as we shall 
see presently, spinorial. Because the spin modules are not modules for the 
orthogonal groups but their two-fold covrs, namely the spin groups, we must 
assume that Lo is the spin group (although in the physical literature the groups 
never come in). The source of this restriction on t; can be traced to the fact 
that in quantum field theory the objects obeying the anticommutation rules 
are the spinor fields. To explain how special these groups are, even within the 
category of super semidirect products, let me go back to the 1970's. The first 
example of a super Lie algebra, the minimal supersymmetric extension of the 
Poincaré algebra, emerged at that time from the works of the Russian school, 
followed a little later by the construction, by Wess and Zumino, of the first 
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example of a simple super Lie algebra, and the general definition of super Lie 
algebras and the classification of the simple ones by Kac (see [hO, ligeheel3] 
of previous essay). The initial idea was to construct a super Lie algebra 


G=9 091 


with the following properties: 


(a) g is a real Lie super algebra and go is the Poincaré algebra of 4- 
dimensional Minkowski spacetime. 

(b) g; is a spinorial go-module which is irreducible over R. 

(c) The spacetime momenta should be among the super commutators 
[A. B] where A. B € gj, i.e., [g). g;] should contain the translation sub- 
space to of go. 

(d) g should be minimal in the sense that there is no super Lie algebra 4 
such that go C h C g where both inclusions are proper. 


The condition (b) includes the requirement that go acts on t; through [y so that 
ty acts as zero. The minimality in (d) forces that t, is irreducible as a module 
for ly (over the real field); indeed, if t2 is a submodule, go @ ts is a super Lie 
algebra satisfying (a)-(c). The complexification of [) is a direct sum of two 
copies of s{(2, C) and so the irreducible representations of [) are of the form 
k @ k°” where k is the irreducible representation of 5{(2. C) of dimension 
k. Spinoriality means that the complexification of t; must be a direct sum of 
the 2 and 2°"! Now 2 @ 2°" has a real form which is therefore the unique 
irreducible real spinorial module for [o; it is called the Majorana spinor by 
the physicists. There is a unique (up to a multiplicative scalar) symmetric 
Lo-equivariant map t; ® t; —> to which then can be used for the bracket of 
the elements of t;. This is the construction of the super Poincaré algebra. 

The physicists take an explicit realization of 2 and the odd bracket, and 
write down the commutation rules for g. Itis more convenient for me to write 
these in the following basis-independent form. Let the elements of C? be 
viewed as column vectors and let 


i= (Ge wlee C| he ={, 2) ae) 


with SL(2, C)-actions given by 


8, (u, u) > (gu, gi) 8, (u, v) H> (gu, gv). 
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Then t; is a real form of t)¢ stable under this action, while tic is2@ 2. We 
take tg to be the space of 2 x 2 hermitian matrices represented as 


Say os X} tix, 


9 2 
- ) det(x) = xy _ oe, —X5 — X3 
AN LOOM) ae 3 a : 


with Lo and its action on ty given by 
Lj see) gti exe. 
Here the exponent 7 refers to the transposed matrix. The Lo-equivariant 


symmetric form t; x t; — > to is then of the form 


[(u, a), (v, v)] = sua" wn): 


Actually we can have any real constant in place of 1/2 since only its sign 
matters for the isomorphism class of g = go ® t; where go = to ® Ip. We 
take the factor to be 1/2 (the negative sign gives a super Lie algebra which 
may be called an isomer of the one we use). This gives the super Poincaré 
algebra used in physics. The choice of the positive sign implies that the 
super bracket has a very nice positivity property. To formulate it let us first 
note that the forward light cone C~ in to is given in the usual coordinates as 
= 0: er — is — xg — i > 0 which reduces to Tr(x) > 0, det(x) > 0 or 
equivalently, that 


Ct = {x € to|x 4 0 is nonnegative definite}. 
We have 
(supercommutator) De. @ uu? A =n) et 


so that 
[X, X] e Ct forallO 4 X € t. 


Since any X € ty which is nonnegeative definite can be written as uu! for 
some u € C’, it follows that 


Xtr> [X, X] 


maps t; \ {0} onto C*. 
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Let P,, be the standard basis of Minkowski space. In the matrix identifi- 
cation, Py = 1, the identity matrix. If e;, e2 are the standard basis vectors of 
C?, and X; = (e;, e;), then 


Po = [X,, X;] + [X2, X2]. 


In any representation, Po gets mapped to (X1)2 + p(Xs)” whiehnis = 0) 
This is the positivity of the energy. 

If 6, are the coordinates with respect to a basis for t;, then x,,, 9, are the 
coordinates for the super spacetime t. The superfields, namely the sections 
of the structure sheaf ©;,, are thus of the form 


O(x, 8) = so(x) + yaa + SS Sap9aOp 
a a<B 
+ >> SapyGapy + 5123401920304. 


a<B<y 


where the s’s are functions of the x’s. Using the differential and integral 
calculus on super manifolds we can then introduce super Lagrangians and do 
superfield theory. 

It is in this framework, and on this 4|4-dimensional spacetime, that Salam 
and Strathdee (see [4a, 4b] of previous essay) defined the superfields and 
analyzed their transformation properties under the super Poincaré group and 
classified the super particles. In the physics literature, the technique of work- 
ing with super spacetimes is known as the method of superfields. 

The construction of super spacetimes and super Poincaré groups can be 
extended to the case when tg has arbitrary dimension D and Minkowski 
signature, and t; is not necessarily irreducible. The key is to show that there 
are always nontrivial Lo-equivariant symmetric bilinear maps 


t; x ty —> to, 


and indeed, which are in addition positive in the sense described above. One 
can classify all the symmetric equivariant maps; the positive ones form a 
nonempty open cone. This leads to a classification of all super Poincaré 
algebras. We shall not go into this classification here. See [7], pp. 90-94, 
258-262, as also [8]. If N is the number of irreducible spin modules in g, 
physicists speak of N-extended super symmetry. The super Poincaré groups 
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are then the semi direct products of the classical group Ly with the super 
spacetimes. 

For deeper insights into supersymmetric physics one has to have a theory 
of super homogeneous spaces. For the construction of the general super 
homogeneous space, see [14]. One can use this theory as exemplified in loc. 
cit. to construct super conformal spacetimes. 


Spinors 


I have passed over in silence the highly nontrivial theory of spinors on which 
the construction of al/ super Poincaré algebras in arbitrary dimension and 
(Minkowskian) signature depend. However a few brief comments may be in 
order. The spin groups are the universal covering groups of the orthogonal 
groups, and the spin representations are certain special representations of the 
spin groups, determined by the Clifford algebra. The basic theory is over C, 
but one needs the version of this theory over R so that it can be usefully applied 
to physical problems. For general references see [6, 7,8, 9]. Physicists mostly 
ignore the spin groups and work infinitesimally; the Clifford algebra appears 
in their work through the yoga of the Dirac gamma matrices. 

The spin representations were originally discovered by Elie Cartan when 
he classified the irreducible representations of the simple Lie algebras in 
1913. For these the basic building blocks are the fundamental representa- 
tions canonically associated to the nodes of their Dynkin diagrams. The spin 
representations are the ones for the orthogonal algebras associated to the right 
extreme nodes. For s0(2k)(k > 3) there are two of these, S*, of dimension 
2‘-! while for so(2k + 1) there is just one, denoted by S, of dimension 2'. 
For 50(4) which is only semi simple and ~ s0(3) © s0(3), the diagram con- 
sists of two nodes which are not linked, and either of the two nodes may serve 
as the one defining the representation. Thus the two spin representations of 
s0(4) are the 2 @ 1 and | ® 2 encountered before. Cartan noticed that the 
spin modules do not occur in the decomposition of the rensors over the space 
on which the orthogonal groups are defined. Then in the 1930’s when Dirac 
discovered his famous relativistic equations of the electron, he rediscovered 
these representations (in the special case of $0(4)) and so S*, S came to be 
known as spin representations, the elements of the spaces on which they act 
were then called spinors. Initially interest was centered only in the case of 
dimension 4 but this changed after Brauer and Wey! wrote their famous paper 
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on spinors in higher dimensions [6]. The higher dimensions arise in physics 
in models where spacetime is extended by multiplication with very small (!) 
compact Riemannian manifolds. 

Let me now explain the connections with the notion of spin and the Dirac 
equation. When Dirac wanted to write a first order relativistic equation on 
flat Minkowski spacetime it was natural for him to look for it as a square 
root of the wave operator. He discovered that a square root does not exist 
over the complex numbers but does exist over the Clifford numbers (Clifford 
algebra). The first order operator with constant coefficients, called the Dirac 
operator, whose coefficients are in a representation (minimal) of the Clifford 
algebra, has therefore 4 x 4 matrices as coefficients, and so can operate only 
on 4-component wave functions, and these components describe the spin of 
the electron. This is the situation alluded to at the beginning of this section. 
However it was not enough to construct the Dirac operator over the Clifford 
algebra; it was also necessary to find out how it transformed under the Lorentz 
transformations. Dirac discovered that one has to go to the two-fold cover of 
the Lorentz group to write down the transformation law. The two-fold cover is 
naturally called the spin group, and its representation, the spin representation. 
It is coupled with the Clifford algebra right from the outset. In the modern 
theory, the spin group is imbedded inside the even part of the Clifford algebra. 

The generalization of these ideas to arbitrary dimension offers no obstacle. 
But defining the Dirac operator over a curved spacetime turned out to be 
more subtle. The reason is that the spinor spaces associated to a quadratic 
vector space V do not vary functorially with V. The attempt to formulate the 
Dirac equations and spinor fields over arbitrary Riemannian and Minkowskian 
manifolds led to the discovery that there are global topological obstructions to 
defining them. On a Riemannian (resp. Minkowskian) manifold we always 
have the Laplacian (resp. the D’Alembertian, wave) operator. The Dirac 
operator, being a square root of these, needs a finer structure to make sense, 
namely, the spin structure. When the obstructions vanish, the manifold is 
called a spin manifold. 

The definition of spin modules given at the beginning, although historically 
correct, is not satisfactory in the sense that it does not allow us to explore their 
structure in detail. To get to this we introduce the Clifford algebra C(V) = C 
of the complex quadratic vector space V (which is just a vector space equipped 
with a nondegeberate quadratic form). The algebra C is a super algebra and 
the Lie algebra g of the orthogonal group has a natural imbedding inside the 
even part Co of C whose image generates Cy. Thus Co is an enveloping 
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algebra of g. The spin modules can then be proved to be precisely those 
modules for g which extend to Co; the proof is nontrivial, see Chapter 5 of 
[7], and [8]. Thus the spin modules can be identified with Co-modules, the 
so-called Clifford modules. It is this identification that leads to the possibility 
of a very detailed study of spin modules. If (e;) is an orthonormal basis for 
V, the Clifford algebra C is generated by the e; with relations 


eje; t+eje; = 25; (i <i, j <n). 


In any representation of C the e; are represented by matrices y; satisfying the 
commutation rules 


ViVji+yviyi = 26; AUAs<i,j sn) 


which are called (after Dirac’s pioneering constructions and notations) the 
Dirac gamma matrices. The yoga of gamma matrices is nothing but a coor- 
dinate version of the theory of Clifford algebras. 

The spin representations are defined over the complex numbers. In physics 
one works over the real field and it becomes important to know what happens 
then. As we have done above we call a real representation of the orthogo- 
nal Lie algebra spinorial if it complexifies to a direct sum of spin modules. 
For instance the space t; in the super Poincaré Lie algebra is spinorial as it 
complexifies to 2 ® 2. Now if we have a finite dimensional irreducible real 
representation of some real associative algebra, its commutant is a division 
algebra over R and so is one of R, C, H by a classical theorem due to Frobe- 
nius; here C is treated as a real algebra, and H is the algebra of quaternions. 
Accordingly, the real representation is said to be of type R, C, H respectively. 
In the case above, the Majorana spinor is of type C because its commutant, 
when complexified, is the commutant of a direct sum of two inequivalent rep- 
resentations, and hence has dimension 2. It turns out that for the general case 
of V of dimension of signature s, the types of the real irreducible spinorial 
modules depend only on the signature s mod 8 and are given by the following 
table (see [7, 8] for a derivation of these tables). 
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Space of arbitrary signature s 


s mod 8 type 
0 R, R 
eae R 
2,6 6 
3, 5 H 
4 H. H 
For Minkowski signature in dimension D, we have s = —(D — 2), and the 


above table becomes the following. 


Minkowski space of dimension D 


D mod 8 type 
0, 4 Cc 
Lod R 
2 Rik 
Da H 
6 Hel 


One is also interested in supersymmetric Lagrangians and mass terms. These 
depend on the structure of real invariant forms and maps. There is a corre- 
sponding table but it is now determined by bot/ dimension and signature mod 
8. The physicists rely on the famous Regge clock whose two hands describe 
the dimension and signature mod 8. See [7, 8] for the full theory (Chapters 5 
and 6) and references. The periodicity mod 8 is a real marvel of this theory. 
For a theory of spinors over any field, see Chevalley’s great treatise [9]. For 
the Regge clock see [10]. For a general treatment of physics and geometry, 
see(15]- 
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The theory discussed above applies at once for § = (Gy. g) a super Poincaré 
group. Because of the Minkowski form we can identify t} with to itself. For 
p € I‘ the stabilizer L/ is given by 
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Spm(l p=) if (p, p) > 
ee Roe x Sping) — 2) te p) = 0, po > 0 


Thus, except when D = 4, py > 0. (p, p) = 0, the stabilizer L?’ is simply 
connected; in the exceptional case of zero mass in 4 space time dimensions, 
the stabilizer is R? x' T where T is a circle group and so still falls within the 
condition (C) described in Section 5.2. 

The key lemma which shows the nature of T* in this case is the following. 


Lemma 2. We have 
QO, =0 <> po = 0, (p, p) =, 


Proof (Sketch). If v € (T*)®, (v. (X. X]) > 0 for allO 4 X € g, by the 
structure of the super Poincaré algebra (positivity of the odd commutators). 
By going to the limit we see that for p € ['*, (p. [X. X]) = 0 forall X € gy. 
This is just the statement that Q,, > 0. For the converse, suppose that Q,, > 0 
but (p, p) < 0. Then Lf = Spin(1, D — 2). On the other hand we have a 
map Lj; —> SO(gj,). Since Spin(1, D — 2) is simple and non compact it 
cannot have a nontrivial map into a compact group. Hence Lj acts trivially 
on g;,,. Thus the action of L{) on g; contains the trivial representation of ibe 
But Lo acts as a spin module and it is not difficult to show that the action of 
Li is asum of spin modules, and so cannot contain the trivial representation. 


Theorem 9. The irreducible unitary representations of a super Poincaré 
group S = (Gy. g) are parametrized by the orbits of p with py = 0. (p, p) = 
0, and for such p, by irreducible unitary representations of the stabilizer Lo 
at p. Lett, be an irreducible SA representation of the Clifford algebra C, 
and let k,, be the representation of Li, in the space of t,, defined earlier. Then, 
for any irreducible unitary representation 6 of Li, the pair (a. p” ) defined by 


Co = e'?6 @ Kp, p° (XxX) =1@ t)(X) (Xx S 91) 


is an irreducible unitary representation of the little super group S? = 
(TyL}), to Bly @gi), and all irreducible unitary representations of S” are ob- 
tained in this manner. The unitary representation © pv of the super Lie group 
S induced by it is irreducible and all irreducible unttary representations of 
S are obtained in this manner, the correspondence (p.9) —> Ojw being 


bijective up to equivalence. 
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Remark 3. The reader can see that in the presence of supersymmetry, only 
representations of positive energy come in, in contrast to the classical situation 
where we could not exclude the tachyonic orbits. Otherwise the reduction to 
the little group is exactly as before, parametrized by the irreducible unitary 
representations of L}’, except that it has to be tensored with the pair («p, Tp, y 
Thus the determination of («,,, T,) is the basic new ingredient. 


Remark 4. The correspondence with representations of the little group allows 
us to treat zero mass super particles with infinite spin, somehting not available 
in the physics literature. Also the SUSY transformations are given on the full 
space of super particle states, not only on those with a fixed momentum. 


5.6 Super particles and their multiplet structure 


The irreducible unitary representations of the super Poincaré group are the 
models for super particles. If we restrict the representation of Go, it will 
in general decompose into a direct sum of particles and this set is called the 
multiplet defined by the super particle. The operators p(X)(X € g)) then 
give the transformations between particles of different spin parities. The 
fundamental muptiplet is the one corresponding to the trivial representation 
of Lj. The irreducible unitary representations into which «, splits then define 
the classical particles of the fundamental multiplet. 

The calculation of «, can be carried out very explicitly. When D = 4 and 
g; is the Majorana spinor, the multiplet with mass m > 0 has the spins 


(i,j, + 1/2, 7 —1/2G > 0) 
{0501/2} = 0). 


For m = 0 the mulitplet has spins (helicities) 
{n/2, (@ + 1)/2}. 


These results are the source for the prediction that supersymmetry forces the 
known elementary particles to have super partners. Whether these partners 
can be seen when the new LHC opens is at best speculative, since we do 


not know the threshold of supersymmetry breaking in the development of the 
early universe. 
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Chapter 6 
Nonarchimedean physics* 


We will argue that for a physical theory of quanta there 
are only two new possibilities, namely the field of p-adic 
numbers or a finite Galois field. 

Igor Volovich 


There is a natural time scale, the Planck scale, that emerges when general relativity and 
quantum mechanics are both significant. No measurements are possible in regions of 
smaller sizes than the Planck units. In particular we cannot compare distances and times 
in sub-Planckian domains. In 1987 Igor Volovich proposed the bold hypothesis that in 
such domains the geometry of spacetime is nonarchimedean. Itis then natural to ask what 


are the consequences of this hypothesis. We address this question in the Dirac mode. 


6.1. Planck scale and the Volovich hypothesis 

6.2. Nonarchimedean symmetry groups and their multipliers 
6.3. Elementary particles over nonarchimedean fields 

6.4. Nonarchimedean quantum field theory 


6.1 Planck scale and the Volovich hypothesis 


That the structure of spacetime itself could be the subject of investigation is an 
idea that dates back to the 1970’s, when many physicists, concerned about the 
divergences in quantum field theories, started exploring the micro-structure of 
spacetime as a possible source of these problems. In particular, Beltrametti 
and his collaborators proposed the idea in [1, 2, 3] that the geometry of 
spacetime might be based on a nonarchimedean, or even a finite, field and 
examined some of the consequences of this hypothesis. But the idea did not 
really take off until Volovich [4, 5] proposed in 1987 that world geometry 


* This essay is based on various lectures I have given for the past several years. I especially 
want to mention conferences in Goslar, in Dubna, in Moscow, and a series of lectures in 
the University of Arizona, sponsored by my friend Hermann Flaschka. 
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in regimes smaller than the Planck units might be nonarchimedean. The 
reasoning behind this hypothesis is this: since no measurements are possible 
at such ultra-small distances and time scales due to the interplay between 
general relativity and quantum theory, comparisons of distances and time 
spans are also ruled out. 

The idea of a fundamental scale emerges naturally when one attempts to 
unify gravitation and quantum theory. This scale, called the Planck scale, is 
one below which it seems that no measurements can be carried out. The idea of 
extended particles (strings, branes) also makes sense in this context. Indeed, 
the Planck scale comes up immediately when one identifies the Schwarzchild 
radius of a particle, basic in general relativity, with its Compton wave length, 
fundamental in quantum theory. This leads to the order of Planck length as 
10-*3 cm and Planck time (time taken by light to travel Planck length) as 
1O= > sce. 

The Planck scale actually goes back all the way to Planck himself. He 
wanted to obtain a fundamental scale entirely in terms of the 3 constants 
h, G, c where G is the gravitational constant. We have 


=2 2 = 

h~6x 10-7’ cm? gm sec”! 
= = eh | 
G ~ 6.7 x 10% cm? gm! sec? 


c~3x 10!9 em sec™'. 


It follows after a simple calculation that we have the following quantities 
which have the dimensions of length, mass, and time, the so-called Planck 
units, which form the Planck scale: 


hen 
Mp, = (=) 17 gm 


hG\'/ 
ip = (*F) ~ 10 * sec. 


From this point of view, a Planck particle is a black hole whose event horizon 
has a radius equal to its Compton wave length, as we described earlier. 
Since impossibility of measurements automatically forbids also compar- 
isons between different lengths and also different times, the Volovich hypoth- 
esis 1s very natural. Now nonarchimedean geometries arise when we work 
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over the field O55 of p-adic numbers, p being a rational prime > 2. Since 
no single prime can be given distinguished status, it is perhaps even more 
natural to see if one could really work with an adelic geometry as the basis 
for space-time. Such an idea was first proposed by Manin [6]. A huge number 
of articles have appeared since then, exploring these and related themes. For 
a definitive survey and a very inclusive set of references see the very recent 
article by Dragovich et al [7}. 

The simplest way to enforce nonarchimedean geometry is to assume that 
spacetime is a manifold over Q,,. Over the reals Wigner associated ele- 
mentary particles with irreducible projective unitary representations of the 
Poincaré group, and classified these in his epoch-making paper of 1939. In 
the paradigm of modern quantum field theory one goes beyond the Wigner 
classification and defines elementary particles as the quanta of quantum fields 
[8]. However, over the p-adic numbers it has not yet been found possible to 
develop much intuition about p-adic quantum fields that are fully covariant, 
for many reasons, such as the absence of a Lagrangian picture. So one has 
to be content with thinking of elementary particles as defined by irreducible 
projective unitary representations of the symmetry groups, the p-adic ana- 
logues of the real Poincaré and Galilean groups. This means also that one 
has to define the spacetime symmetry groups as p-adic Lie groups and study 
their projective representations. 

There are many studies designed to show that at Planckian distances and 
times spacetime itself undergoes fluctuations. It is quite plausible that such 
fluctuations change the ground field from R to the Q,,. The themes discussed 
in this essay are thus of interest, at least in the Dirac mode whose essence 1s: 
do the mathematics first and then look for physical interpretation. 


6.2 Nonarchimedean symmetry groups and their multipliers 


How do we formulate the notion that a quantum system has a symmetry 
group G? A symmetry of a quantum system is a bijection of the projective 
space P(H) of the Hilbert space 1 of the system preserving the transition 
probability |(@, w)|? between two states [], [y] defined by unit vectors @ 
and yw. It is aclassical theorem of Wigner and Eckhart that if o is a symmetry, 
then o is of the form 


[p] +> [RO] 
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where R = R, is aunitary or anti-unitary operator; R is uniquely determined 
by the symmetry up to a phase factor. So, if 0, 0), 02 are symmetries with 
GO = 0;02, and R, R;, R2 are operators inducing them, we cannot be sure 
that R = R,R> but only that R = mR; R>2 where m is a complex number of 
absolute value |. It follows from this that the symmetry of a quantum system 
with respect to a group G, locally compact and second countable (Icsc), may 
be expressed by a projective unitary representation (PUR), either of G or of 
a subgroup of index 2 in G, in the Hilbert space of quantum states. Here 
we must recall that a PUR, to be thought of as a map of the group into the 
projective unitary group of a Hilbert space, may be viewed as a Borel map 
x +> U(x) of G into the unitary group of 1 such that 


U(xnuU yam, yy Cee, U()je=1 @epmeG). 


The function m, with values in the circle group, is called a multiplier for U, 
and U is called an m-representation. It satisfies certain identities coming from 
the associativity of multiplication and we call any such function a multiplier; 
the point is that every such function arises as the multiplier of some projective 
representation. The relations satisfied by m are 


m(xy, 2m(X, y) =n v2 2). m(x, |) =7eGia) =. 
The projective representation does not change if we make the change 
U(x) > a(x)U (x); 
but in this case, m changes to m’ where 
m’'(x, y) = a(x)a(y)a(xy)7'm(x, y). 


We write m ~ m’. The multipliers form an abelian group under point-wise 
multiplication, those ~ | form a subgroup, and the quotient is the multiplier 
group H?(G) of G. This is an important (cohomological) invariant of G and 
its determination is basic to all investigations. 

The insistence on Borel multipliers is not out of a desire for generality. 
On general grounds projective representations of a group G are the same 
as ordinary representations of a central extension E of G by a circle group 
T. However the map E — G may not have continuous sections. It was 
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Mackey who discovered that Borel sections always exist giving rise to Borel 
multipliers. 

If U is an m-representation we introduce the group E,, = T x G, T being 
the group of complex numbers of modulus 1, with the multiplication law 


(Ss, x)(t, y) = (stm(x, y), xy). 


It is easy to verify that the properties of m guarantee that E,, is a group, with 
natural maps 


T— En, tr (t,1) Em — G, (8,xX)hR > x. 
At the level of abstract groups we have a central extension 
1—> T — E,, — G — 1. 


Then 
V : (5s, x) > sU(x) 


is an ordinary representation of E,,,, giving us a way to remove the multiplier 
by passing to a central extension of G. But the topologies are missing; we 
have not given a recipe for viewing E£,, as a topological group, much less as 
an Icsc group. If m is continuous, then we can use the product topology for 
E,,; otherwise we use a beautiful theorem of Mackey that there is a unique 
topology for E,,, in which it is lcsc and the above is a central extension of 
Icsc groups. The topology is the Weil topology; we consider the Hilbert space 
L?(E,,) with respect to the measure dtdx where dr is the usual Lebesgue 
measure on 7 and dx is a left Haar measures on G, and give the topology for 
E,, in which the left regular representation of E,,, is a homeomorphism into 
the unitary group of L7(E,,) with its strong operator topology [9]. 

The central extension E,, depends on m and so it is reasonable to ask if 
we can find a universal central extension G* of G with the property that all 
PURs of G are unitarizable when lifted to G*. This is not always possible; 
the commutator subgroup of G must be dense in G for a universal topological 
central extension (UTCE) to exist. For the usual Poincaré group P = R* x’ 
SO(|, 3)°, E. Wigner [9, 10], proved that all PURs of the connected real 
Poincaré group P lift to URs of the simply connected covering group P* = 
R' x’Spin(R!) of P where x’ denotes semidirect product. In other words, 
P* is already the UTCE of the Poincaré group. 


160 6 Nonarchimedean physics 


Central extensions go back at least to Schur who calculated them for the 
symmetric and alternating group. Weyl [11] was the first to emphasize the 
importance of PURs in quantum theory. He called them ray representations. 
Bargmann [12] extended Wigner’s work to all dimensions, besides giving 
a systematic treatment of multipliers of Lie groups and treating the case 
of the Galilean groups also. For a deep and systematic treatment of the 
Galilean group see [13]. For varied aspects of the theory of multipliers see 
(14,15, 16, 17, teaetSby- 

The situation over a disconnected field is quite different. Over a nonar- 
chimedean local field k , let G be an algebraic group defined over k and G(k) 
the Icsc group of its k-points. The commutator subgroups of G(k) are often 
open and closed subgroups of G(k), and it is generally a very delicate matter 
to verify whether they are equal to G(k). So it is preferable to work with the 
PURs of G(k) itself, rather than look for UTCEs of G(k). Number theorists 
have long been interested in URs of groups G(k) for simple groups G. In 
physics, groups with radical appear to be important and so it is worthwhile 
to study PURs of these groups as well. The groups G(k) are all totally dis- 
connected. For totally disconnected groups David Wigner proved that all 
multipliers are equivalent to continuous ones [14]. 

Let k = Q,. Actually most of what I say will remain valid if the char- 
acteristic of k is # 2. Let me introduce the Poincaré group. Let V be 
an isotropic quadratic vector space over k; this means that V has a non- 
degenerate quadratic form defined over k which has null vectors over k. 
Then we have the algebraic groups G = SO(V) and its two-fold algebraic 
cover Gy»in = Spin(V). We thus have correspondingly the Poincaré groups 
P= V x’ Gand P,pin = V x! Gopin. Write V. G and G,,;, for the groups 
of k-points of V, G, and G,,;,, respectively, and P, Pepgtor the. respective 
groups of k-points of P, P,,,;,. Unlike the case when k = R, we can no longer 
speak of the signature but the Witt class of V. 

Moore [15] first calculated the UTCEs of simple groups split over k 
(Chevalley groups), and then Prasad and Raghunathan [16] extended his re- 
sults (amd much more) for nonsplit but isotropic groups. Now Gspin and 
Pspin do have TCE’s, for the spin groups this is a consequence of the work 
of Moore and Prasad and Raghunathan, and for the corresponding Poincaré 
groups, of the work of Varadarajan [17]. Moreover, if CO is the universal 
TCE of Gyn, itis shown in [17] that the universal TCE P* 


* ak ; spin of Pspin is given 
by Ponin = VX" Gyyin- SO all PURS of Pepin lift to URs of P*,,,, and since P* 


is a semidirect product, the Mackey-Wigner theory is applicable. We are thus 
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in the same situation as in the real case and there are no fundamental obstacles 
to the classification of the particles (=irreducible PURs) with Psnin-Symmetry. 

However the natural maps Gspin —> Gand P;,in —> P are not surjective 
(even though they are surjective over the algebraic closure of k), and so 
replacing the orthogonal group G by the spin group G,,;, leads to a loss 
of information. So we work with the orthogonal group rather than the spin 
group. To illustrate this point, letG = SL(2, Q,,). The adjoint representation 
exhibits G as the spin group corresponding to the quadratic vector space g 
which is the Lie algebra of G equipped with the Killing form. The adjoint map 
G — G, = SO(g) is the spin covering for SO(g) but this is not surjective; 
in the standard basis 


ee), fe). r= (0%) 


the spin covering map is 


ap Geb =p" 
(: i) +t > | —acad+bc bd 
Sood «d= 
The matrix 
aQ 0 
Oal:0 
00a7! 


; Pe b 
is in SO(g); if it is the image of (: a then b = c = Oanda = a’, so that 
c 


unless a@ € Oe this will not happen. 


The group P is still a semidirect product but we are now required to 
determine its irreducible projective URs. This means that we must deter- 
mine its multipliers and then, to each multiplier m, find the irreducible m- 
representations. The multipliers of a semidirect product were determined by 
Mackey long ago [17, 18]. It turns out further that there is a very nice variant 
of the Mackey machine for m-representations of a seimidirect product that 
allows us to do this [19], which appears not to have been noticed in previous 
literature. The variant is formulated in the framework of locally compact 
groups and so is applicable to adelic geometries as well, but here we restrict 
ourselves to the local case. 
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I shall now describe Mackey’s beautiful formula for multipliers of semidirect 
products. Let H = A x’ G where A and G are Icsc groups and A is abelian. 
Let A* be the character group of A. We need to introduce H'(G, A*) which 
we do as usual. An 1-cocycle for G with coefficients in A* is a Borel map 
f(G —» A*) such that 


f(ge') = f(g) +alfle)] (g, 8° €G) 


or equivalently that g +» (f(g). g) is a Borel homomorphism of G into 
the semidirect product A* x’ G. Thus all !-cocycles are continuous. The 
abelian group of continuous I-cocycles is Z'(G. A*) and the cobound- 
aries are cocycles of the form g / g[x] — x for some x € A*. These 
form a subgroup B'(G, A*) of Z'(G. A*) to give the cohomology group 
HG, A) = 7G A) BG A). 

In practice we are primarily interested not in the whole of H*(H) but only 
in M,(#), the group of multipliers on H that are trivial when restricted to 
A x A. Here we must remember [11] that abelian groups can have nontrivial 
multiplier groups and so the above requirement is really a condition on the way 
that G operates on A; I shall comment on this presently. Let F2GED) denote 
the image of M,(H) in H?(H). Let M/,(H) be the group of multipliers m 
for H with m|,4, 4 = m|4.G = 1. From [17, 18] we find the isomorphism 


H2(H) ~ H?(G) x H'(G.A*). 


More precisely, any element in M,(#) is equivalent to one in M(H). If m € 
M',(A), mo = migxeg, and 6,,(g—')(a’) = migra’), them, © Z'(GwA*), 
and m +> (1m, §,,) iS an isomorphism M‘,(H) = Z*(G) x Z'(G, A*) which 
is well defined in cohomology and gives the above isomorphism. It follows 
from this that ifn € Z(G) and 6 € Z'(G, A*) are given then one may 
define m € M)(H) by m(ag.a’g') = n(g. 2')0(g7')(a’). If mn = 1 then 
m(ag,a’e') = O(g7')(a'). If H'(G. A*) = 0 every multiplier of His 
equivalent to the lift to H of a multiplier for G. 

As I mentioned above, abelian groups in general have nontrivial multiplier 
groups. If is a multiplier for A and g € G, then 


A 


US sabe > u(gag!, gbg') 
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is again a multiplier for A, and the map jz +> pS induces a map 
H?(A) —> H?(A). 


AS g varies we obtain an action of G on H*(A). Suppose now that U is a 
PUR of AH with multiplier m and let m4 be the restriction of m to A x A. 
Then, for fixed g € G, ma is once again the multiplier of the PUR a +> 
U(g)U(a)U(g)~! of A. On the other hand, for some constant c(a) with 
lc(a)| = 1 we have 


U(g)U(a)U(g)"' = c(a)U(gag"'!) (ae A) 
and so the representations 
at U(g)U(a)U(g)7! and ato U(gag ') (a € A) 


are projectively the same. Since m%, is the multiplier of the latter PUR, it 
follows that m4 and m‘, are equivalent. Hence the corresponding element in 
cohomology, the class [7 ,] of m4, is fixed by g. Thus [17,4] is G-invariant. 
If we know that G does not fix any nonzero element of H*(A), then m, is 
trivial, and we are in the framework of Mackey’s formula discussed above. 

Let us write A*(A) for the group of alternating bicharacters of A, namely 
maps f(A x A —> T which are characters in each variable separately, and 
satisfy f(a, b) f(b. a) = 1. Itis then easy to check that such f are multipliers 
for A and we have a map A*(A) —> H?(A). If we now assume that A is 
2-regular, 1.e., the map a +— 2a is an automorphism of A, it is then shown 
in [20] that the above map is an isomorphism, i.e., 


A?(A) ~ H’(A). 


The conditions 
A is 2-regular, A)” 0) 


then guarantee that all multipliers of H are trivial on A. 

For instance, if V is a finite dimensional vector space over R or Q.,, then V 
is 2-regular and A*(V) is naturally isomorphic to the space of skew symmetric 
bilinear forms on V x V, as was shown in [17]. If b is a skew symmetric 
bilinear form on V x V, the map m, : x, y +-> e’?°-” is a multiplier for V, 
and b +> m, induces an isomorphism A7(V) ~ H°(V). If we have a PUR 
of H where A = V, then the bilinear form b has to be invariant under G. If 
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G leaves a symmetric nondegenerate bilinear form invariant and if the action 
of G is irreducible, then the space of invariant bilinear forms is of dimension 
1, and so b must be symmetric, hence 0. In particular, for the Poincaré group 
this means that b — 0 and we are in the framework discussed in Mackey’s 


result. 


Mackey machine for PURs of semidirect products 


If U is a UIR of H, its restriction to A can be described by a projection valued 
measure P on A* by 


U(a)= / x(a)dP(x) (ae A); 
A* 
to say that the spectrum of U is contained in a set E simply means that 
P(A wb) = 0, 


This said, let us continue with H = A x’ G. The Mackey machine for 
ordinary representations operates as follows. We look at the action of G on 
A*. Let O be any orbit for this action. To this orbit one associates a set of 
UIRs of H. We fix a x € O and let G, be the stabilizer of x in G. Then 
to any UIR p of G, is associated a UIR To, of H; the equivalence class of 
To. depends only on that of p, and inequivalent p’s give rise to inequivalent 
T’s. The URs To, have their spectra contained in the orbit O. If we change 
to a different x in O, the set of UIRs of H described by this procedure does 
not change. One can describe this set as precisely the set of all UIRs of H 
whose spectra are contained in O. Clearly, the T’s associated to distinct O’s 
are inequivalent, since their spectra are contained in different, hence disjoint, 
orbits. As we vary O we thus obtain a disjoint family of subsets of the unitary 
dual of H. 

Under a regularity condition on the action of G on A* this procedure, called 
the Mackey machine, gives all of the unitary dual of H. The condition is that 
the action of G on A* is Borel smooth, i.e., there is a Borel set in A* which 
meets each orbit of G exactly once. What happens is that in the presence of 
regularity, the spectrum of any UIR of H is necessarily contained in some 
orbit. It is a famous and beautiful theorem of Effros [21] that the action is 
regular if and only of each orbit is locally closed, i.e., open in its closure. Of 
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course the action need not always be regular. Non regular actions are often 
called ergodic. In the ergodic case there are UIRs of H whose spectra are not 
contained in any orbit, and so these cannot obtained by the Mackey machine 
[9]. The complete determination of the unitary dual of H in such cases is still 
elusive, as far as I know. 

It turns out [19] that there is a remarkable variant of the Mackey machine 
for dealing with PUIRs of H. In order to describe this we shall now introduce 
certain new actions of G on A* defined by cocycles in Z(G. A*). If6 : G > 
A* is acontinuous map with 6(1) = 0, then, defining g{x} = glx] + 4(g), 
for g € G, x € A’, it is easy to see that g : x +> g{x} defines an action of 
G on A* if and only if 6 € Z'(G, A*). This action depends on the choice of 
the cocycle 9 € Z'(G. A*), so we write it as g{x}. The actions defined by 
@ and 6’ are equivalent in the following sense: if 0’(g) = 6(g) + g[E] —& 
where & € A*, then gy, = t~!' 0 g, ot where T is the translation by & in A*. 
The action gy : x +> ge{x} is called the affine action of G on A* determined 
by 6. 

We now obtain the basic theorem of irreducible m-representations of H, 
which is a variant of the Mackey machine. 


Theorem. Fix x € A*, m = (mo,6). Then there is a natural bijec- 
tion between irreducible m-representations of H = A x' G whose spectra 
are C G{x} (the orbit of x under the affine action) and irreducible mo- 
representations of G,, the stabilizer of x in G for the affine action. If the 
affine action is regular, every irreducible m-representation of H, up to unitary 
equivalence, is obtained by this procedure. 


Corollary. Suppose H'(G, A*) = 0. Then we can take 6(g) = 1 and 
m(ag, a’g’) = mo(g, g’). Inthis case, the affine action reduces to the ordinary 


action. 


Remark. It follows easily from the relationship between the affine actions 
defined by two equivalent cocycles 4, 4’ described earlier that the classes of 
PURs defined by 6 and 6’ are equivalent. 


6.3 Elementary particles over nonarchimedean fields 


From our point of view the simplest situation arises when spacetime is a finite 
dimensional, isotropic quadratic vector space over a nonarchimedean local 
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field k of ch # 2. We shall actually assume that k = Q,. Let G = SO(V) be 
the group of k-points of the corresponding orthogonal group preserving the 
quadratic form. By the k-Poincaré group we shall mean the group 


Pp =V x'G. 


It is the group of k-points of the corresponding algebraic group which is 
defined over k. Note that when k = R this means that we do not get the 
connected Lorentz group but the group where only the condition that the 
determinant is 1 is imposed, not the orthochronocity. The requirement of 
Minkowki signature does not make sense over k. Instead we fix the Wirt 
class. The results below do not depend on the Witt class. We write V’ for the 
algebraic dual of V. With its k-topology it becomes isomorphic with V™ as a 
GL(V )-module. 

We have already seen that all PURs of the Poincar’e group have trivial 
multipliers when restricted to the translation subgroup V. The main point 
here therefore is that the cohomology H!(G, V’) = 0, as was proved in [17]. 
Hence the PUIR’s of P, can be obtained by the above theorems. They are 
classified by the orbits of Gin V’. The orbits are: the level sets of the quadratic 
form when the value of the quadratic form (called the mass) is nonzero; the 
level set of zero with the origin deleted; and the singleton consisting of 0. 
These are referred to as massive, massless, and trivial massless respectively. 
The orbit action is regular by the theorem of Effros since all orbits are either 
closed or open in their closure. All the orbits have invariant measures on 
them. 

This classification shows that the elementary particles over Q,, have a richer 
structure than in the real case. The PUIR’s are first classified by mass, and 
for a given mass, by the PUIR’s of the stabilizer in G of a point in that mass 
orbit. Unlike the real case we cannot replace the PUIRs of the little groups by 
URs of a single TCE of these groups. The determination of all the multipliers 
of the little groups is still not completely done. 


Study of H'(G, U) for p-adic Lie groups G 


So everything boils down to the vanishing of H'(G, V’). This is not com- 
pletely trivial and so let me try to explain it in some detail. For various 
technical aspects of p-adic Lie groups and 1-cocycles see [22]. We look at 
H'(G. U) where G is a p-adic Lie group and U a finite dimensional vec- 
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tor space over Q,, on which G acts linearly. The cocycles are morphisms 
of p-adic Lie groups and so are analytic and one can differentiate. For any 
X € g = Lie(G) let 0(X) be the left invariant vector field on G differentiat- 
ing by the rule (0(X)g)(x) = (d/dt),-og(x exp?X) for x € G and analytic 
functions g on open sets of G containing x. Then we have a linear map 


X+—> (0f)(X) = (A(X) f) (1). 


If we replace f by f + f’ where f(g) = g-u — u for some u € U, then 
Of'(X) = df (X) + Xu showing that the cohomology class of df does not 
change. So f +> df induces a map 


den (G0) ear (ae). 


For any cocycle f we write [ f] for its cohomology class. Notice that as the 
group cocycles are U-valued, they become a vector space over Q,, and so 
H'(G, U) is aQ,,-vector space and @ is a Q.,-linear map. 

In general the map H '(G, U) —> H'(g, U) has no reason to be surjective 
because a Lie algebra morphism g —> U 6’ g lifts only to a morphism of an 
open compact subgroup of U x G, so that the Lie algebra cocycle can only 
be exponentiated to a cocyle on a compact open subgroup K of G. Moreover 
one needs the cocycle only on a small neighborhood of 1 in G to get the Lie 
algebra cocycle. This makes it clear that from the Lie algebra we only get 
the inductive limit limx H'(K, U) where K runs through the (diminishing) 
compact open subgroups of G. For these questions, not only for H ' but for all 
H’ see Lazard [23a] and Casselman-Wigner [23b]. We proceed in a simpler 
way, since we are dealing only with H'. 

The kernel T° of the map 


Gees Hea 0) 


can easily be computed and it is the set of classes which contain a cocycle 
vanishing on a compact open subgroup K of G, hence constant on the cosets 
eK. If f is such a cocycle, it follows that f(x) is fixed by KM coke! . TE 
we now make the technical assumption that the stabilizers of elements of U 
are eiher of measure 0 in G or all of G, we may conclude that f takes values 
in U®, and hence that 

| r ~ Homo(G, U®) 
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where the suffix 0 means morphisms that vanish on some compact open 
subgroup of G. If this Hom space is 0 and H'(g. U) = 0, then iiGany = 0 
will follow. The technical assumption, seemingly bizarre, is quite natural. If 
we assume that the action of G on U is algebraic and defined over k, the 
stabilizers are algebraic groups defined over k. If G is connected the k-points 
are then either of measure 0 or the whole of G. 

If G is semisimple, the Lie algebra cohomology is 0 as we are in character- 
istic 0. Hence it is the vanishing of the Hom space that needs to be established. 
There are several circumstances in which this can be proved. For instance, if 
G is generated by compact subgroups. Indeed, if K is a compact subgroup 
and f € Homo(G. U%), then f(K) is a finite subgroup of Q,, and so is 0, 
so that the kernel of f contains all compact subgroups and so is G. Another 
condition is if the commutator subgroup of G is of finite index in G; for then 
f(G) itself is a finite subgroup of Q,, and so is 0. If G is semisimple this 
condition is satisfied. for full details see [17]. 


Galilean group and Galilean particles 


Over the reals we have a rather full treatment of the Galilean symmetry and 
the associated Galilean particles [12, 13]. I shall now discuss how Galilean 
symmetry can be treated over Q,,. This example is interesting since we have 
nonzero cohomology. Galilean spacetime V = k’*! has the decomposition 
into space and time: V = Vo ® V; where Vp = k”. V, = k. The Galilean 
group is the semi direct product G = V x’ R where R itself is the semi direct 
product of rotations and boosts. Thus Vo is a quadratic vector space. We set 
Ro = SO(Vo) at first and set R = Vo x’ Ro. The action of G is defined by 


r= ((u,n), (v, W)):@,t) > (Wx +tvtu,tt+n). 


We write (-. -) for the bilinear form on Vy. The dual V’ consists of pairs (€. 1) 
with duality ((&, 1). (wu, n)) = (€. uw) + tn. The actions of the group R on V 
and V’ are given by 


(v, W): (4, n) +> (Wun, n), (v, W): (€,t) > (WE, t-(WE, v)). 


Let 


6,(v, W) = (2tv, —T(y, v)) (t Ek, (v, W) € R). 
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we 6, are in Z'(R.V’) and t /> [9,] 1S an isomorphism of k with 
H'(R.V’). Let no be a multiplier for Ro and let n be the lift to G of no 
via the composition of the maps G —> R and R —> R». Define Far DY 


Mnyc(r, 1’) = no((v, W), (v’, W’))(—2t(v, Wu’) — tn'(v, v)) 


for r = ((u,n).(v, W)),r’ = ((u’. 1’). (v'. W’)) € G. Then it follows from 
[17] that 


(np, T) > nm,,,.7 


gives an isomorphism of H?(Ro) x k with H?(G). This extends to the nonar- 
chimedean setting the results of Bargmann [12] and Cassinelli et al [13]. 

From this we can determine the Galilean particles. The analysis is some- 
what involved and we just give the highlights [19]. We treat only the case 
t #0. Fix nowt #0. The affine action corresponding to the cocycle 6, is 
given by 


(v, W): (&,t) > (WE 4+ 2tv, t — (WE, v) — T(v, v)). 
It is an easy calculation that the function 
M : (&,t) > (€,€) + 4tt 


is invariant under the affine action and maps onto k, since M((0, a/4t)) = 
a. If M((&.t)) = a the element (€/2t, J) of R sends (0. a/4r) to (&, fr). 
Hence the sets M,, where M takes the value a are orbits for the affine action. 
The stabilizer in R of (0. a/4t) is just Ro. Hence for a given a the m,,,.r- 
representations are parametrized by the no-representations of Ro. 

However a little more analysis reveals that for different a all these repre- 
sentations are projectively the same. The projection map 


(Cea s 


is a bijection of the orbit M, with Vj. The action of R on M., becomes the 


action 
Et > WE + 2tv 


under this bijection and so Lebesgue measure is invariant. The parameter 
a has disappeared in the action. In the Hilbert space of the corresponding 
representation, the spacetime translation (u. 17) acts as multiplication by 


170 6 Nonarchimedean physics 


v(uo+ moe) 


na 
Wi) 
pulls out and is independent of the variable €. Hence it is a phase factor and 
can be omitted. The resulting projective representation is thus independent 
of a. Hence all these representations represent a single particle. The true pa- 


rameters are t(# 0) and the projective representations jz of Ro. We interpret 
t as the Schrédinger mass, and wu as the spin [19]. 


The factor 


Conformal symmetry 


It was discovered long ago [24a, 24b] that Maxwell’s equations are invariant 
under conformal transformations of spacetime. In recent years this question 
has been treated from the point of view of representation theory. The starting 
point of this view is the fact that the Poincaré group P is imbedded in the con- 
formal group C which is SO(2, 4). If now U is an UIR of P, it makes sense 
to ask if U has an extension to C as a UR and if so, is the extension unique? 
Such an extension will obviously be a UIR of C. The conformal group in- 
cludes dilations and so for an extension to exist, the particle represented by U 
has to be massless. It was shown by many people (see the references in [25]) 
that the UIRs representing the massless particles with finite helicity extend 
as URs of the conformal group. The question makes sense and is interesting 
in all dimensions. Over the reals the Poincaré group P,,,; of the Minkowski 
space R'” can be imbedded in the conformal group Cy.) = SO(2,n + 1) 
in such a way that the spacetime R'-” is dense and open in a compact homo- 
geneous space for the conformal group, the so-called conformal spacetime. 
In [25] Angelopoulos and Laoues settled this question for all n. The the- 
ory of the conformal group goes back to Dirac who was deeply interested 
in electrodynamics, both classical and quantum, in conformal space; it has 
been continued in modern times by Angelopoulos, Flato, Fronsdal, Ferrara, 
Laoues, Sternheimer, and a host of others [25, 26]. 

Itis an interesting question as to how much of these theories can be extended 
Om— © ) We begin with the geometry first. Let V a quadratic vector space 
overk, V =k @V, and (resp. P;) the k-Poincargroup of V (resp. the 
k-Poincaré group of V). Let Vy) = V @U where U is a quadratic vector 
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space with a basis u,v such that (uw. uw) = (v, v) = 0, (u, v) = 1. We define 
Vo =k @Vo,U =k QU. Then C = SO(W) is an algebraic group defined 
over k. We write C(k) for its group of k-points. 


Then the group Pj is isomorphic, as an algebraic group over k to the 
stabilizer C,, of u in C. The isomorphism is defined over k and gives an 
isomorphism of P, with C,,(k), the stabilizer of w in C(k). Writing Vo as 
ku ® kv @ V the imbedding is given (in block matrix form) by 


1-2 e(t, R) 
(t, R)rR> 10 1 0 
0 f¢ R 


where e(t, R) € Hom(V, ku) is the map v +> (tf, Rv)u and R € SO(V). 

Let £2 be the cone of null vectors in Vo and [Q] its image in projective space. 
Let A, = {a € Q|(u,a) 4 0}. Thena = au + Bu + w, where w € V, and 
B #0. Taking B = | does not change the image [a] of a in projective space, 
and then a = —(w, w)/2 so that [a] is given by [—(w, w)/2 : 1 : w]. Thus 
[a] is entirely determined by w. Thus J : w th [—(w.w)/2:1: wlisa 
bijection of V with the image [A,,] of A, in projective space. Then one can 
show the following. There is a natural conformal structure on [G2], and the 
group C(k) acts transitively on [8&2]. Moreover [A,,] is a Zariski open dense 
subset of [Q] stable under C,,, and the imbedding J intertwines the action of 
the Poincaré group P, with that of C,, on [2] 


Fork = Q,,, a local field, [8&2] is compact and so we have a compactification 
of space-time into [2]. For this reason it is natural to call [2] the conformal 
space-time over k. The conformal group in general will move points of 
spacetime into the infinite part [Q] \ Aj,);. It is only the Poincaré group 
extended by the dilatations that will leave spacetime invariant. 

We shall now say that a Q.,-particle has conformal symmetry if the cor- 
responding PUIR extends to the conformal group. It is not difficult to show 
that massive particles do not have conformal symmetry. However I do not 
know if massless particles have conformal symmetry. This question, and 
more generally, the relation between UIRs of the conformal group and the 
Poincaré group, as well as possible analogs of the Dirac representations and 
the AdS geometries to the p-adic case remain to be investigated inasystematic 


manner. 
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Nonrelativistic quantum mechanics 


I would also like to mention a hybrid theory which is really nonrelativistic 
quantum mechanics in the usual mode, but the kinematical and dynamical 
objects have p-adic manifolds as sources. For instance, one can have a 
conventional model where the dynamical Hamiltonian is an operator on the 
L? of a p-adic manifold. The absence of differential operators is not an issue 
since we can use Fourier transfom theory to define and work with pseudo- 
differential operators. One interesting question is the construction of path 
integral representations for the propagators. The scalar theory was begun 
in [27a] and the theory for systems with internal degrees of freedom was 
constructed in [27b]. Unlike the theory over R where the free hamiltonian 
is the Laplacian and the path measure is the Wiener measure on the space of 
continuous paths, in the p-adic case the measure is on the Skorohod space 
of right continuous paths with only jump discontinuities; it is not a Gaussian 
measure but the measure associated to a stochastic process with independent 
increments. 

Let K be a nonarchimedean local field and W a finite dimensional vector 
space over K. We choose a nontrivial additive character on K and define 
Fourier transforms on functions on W with complex values. Let F be the 
Fourier transform operator, defined on L7(W). We equip W with an ultra 
metric norm | - | and define the free hamiltonian as A, := Fo|- |? oF~! where 
| - | is multiplication by | - |°. The full hamiltonian is then H = A, + V 
where V is the multiplication by a potential function, also denoted by V. In 
the vector case the Hilbert space is L?(W)" and V is ann x n matrix valued 
function. If we work with R and take b = 2 with V having a euclidean norm, 
then A, = —A, is the usual Laplacian. Thus the free hamiltonian in the 
p-adic case is a generalization of the Laplacian. There is no need to restrict 
feithe case p= 2. 

It turns out that the function exp(—r|é |”) (t > 0) is the Fourier transform 
of a probability density function f,, on W, and the family (f,,);+09 defines 
a continuous convolution semigroup of probability densities which goes to 
the Dirac delta measure at the origin as t — 0. Hence for any starting 
point x € V we can associate a W-valued separable stochastic process with 
independent increments, X(t), with X (0) = x, such that for any t > 0.u > 0, 
X(t +u) — X (uw) has the density f,,. It can be shown by standard arguments 
that go back to Censov in 1956 that the process X(t) defines a probability 
measure P‘ in the space D of right continuous paths on {t > 0} with values in 
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W having only jump discontinuities; the space D is the one first introduced by 
Skorohod, provided with a topology that is acomplete separable metric. More 
generally, for given x, y € W there is a measure P! . on D corresponding 
to the process X(t) as above, but now conditioned by the requirement that 
X(0) = x, X(t) = y. The conditioned measure P! .. is constructed in the 
same way as P’, using the conditional densities of the joint probability laws 
at time points f), fo,..., ty. 

In [27a] only the scalar case was treated and the potential V was assumed 
to be bounded also from above, so that there was no question of the essential 
self-adjointness of the Hamiltonian H = A, + V. In [27b] the vector case 
was fully treated, V now being a function with values as self-adjoint n x n 
matrices. If V is locally bounded, it can be shown that H is essentially 
self-adjoint on the space of Schwartz—Bruhat functions; the proof in [27b] is 
essentially that of Kochubei. 

The hamiltonian H is now well defined and > 0. So the operator semigroup 
e~= (§(z) => 0) is well defined and one can get the quantum propagator 
e''#! by analytic continuation from the positive semigroup e~'” (t > 0). 
The Feynman-—Kac path-integral formula describes the operator e~' 
integral operator (propagator) with kernel K,(x, y) where 


A as an 


K,(x, y) = [ exo(- | V(o(s))ds Ja P} (0). fualt —¥ 
D 0 


In the vector case one has to be more careful. The integrand in the above 
formula has to be modified, with a time-ordered exponential in place of the 
ordinary exponential. The simplest way to proceed is to define, for any a in 
the Skorohod space of paths with values inn x n matrices, the time-ordered 
exponential E(a) by 


< E(a(t)) = E(a(t))a(t), (E(a(0)) = I. 


Then the propagator for e~'” is an integral operator with kernel K,(x, y) 


where 


K, (x,y) = ik exp ( = i E(—V o w(s)ids Ja? (0) ape y). 
D 0 


For details and references to the works of Censov and Kochubei mentioned 
above, see [27a, 27b]. 
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Supersymmetry 


Finally | would like to mention that the extension of supersymmetry to the 
p-adic setting is an interesting open question. Supersymmetry over R is 
well understood [28a]. For URs of super Lie groups over R we now have 
a definitive theory [28b]. Its extension to the nonarchimedean setting is an 
open challenge. It appears likely that we may have to study representations 


in p-adic Banach spaces. 


6.4 Nonarchimedean quantum field theory 


It is consistent with the philosophy expressed in the preceding remarks to 
ask if one can construct nonarchimedeal analogues of infinite dimensional 
quantum systems such as quantum fields or scattering theory. This is a very 
interesting open area and some fundamental results are just being obtained. 
First of all, some remarks on usual quantum field theory are in order before 
we get to the more exotic nonarchimedean case. Historically, the beginnings 
of the theory of quantized fields goes back to the work of Dirac on radiation 
theory and of Heisenberg and Pauli who attempted to do canonical quantiza- 
tion of fields in imitation of the standard version of matrix mechanics. After 
the second world war, when new experiments opened up the problems of 
theoretical explanations of these results, physicists developed the theory of 
quantum electrodynamics (QED). In spite of great mathematical difficulties a 
consistent approach to QED was developed by the heroic efforts of Feynman, 
Schwinger, and Tomanaga, based on renormalization, a technique whose full 
structure was revealed by the work of Dyson. From then on renormalization 
was accepted as an essential component of reality, and eventually the stan- 
dard model was constructed, encompassing all known results of elementary 
particles. The quantized field became the fundamental paradigm of modern 
physics, with the elementary particles arising as the quanta of excited fields 
[8]. Many people, including Dirac, however had difficulty in accepting many 
aspects of this development such as renormalization completely, and began 
searching for more mathematically acceptable ways of reaching the princi- 
pal results of this developments. This program has had only a very limited 
success, but nevertheless it has revealed the depth of the questions, and un- 
earthed mathematical problems of great interest, irrespective of their physical 
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significance, not to mention the foundations it provided for the Spin-statistics 
connection and the PCT invariance. 

One approach that has been pursued to give a better understanding of 
quantum fields is that of the so-called vacuum expectation values 


EN Abus iN) =e)... OCF y)&2, 02) 


where 2 is the vacuum state, and ®(f;) are the field observables, the ii 
being test functions on Minkowski spacetime M (either of compact support or 
rapidly decreasing). The idea of field observables associated to test functions 
goes back to Bohr and Rosenfeld who showed by decisive arguments that 
the quantum field strengths fluctuate too wildly so that only their averages in 
a small region, determined by test bodies, can be regarded as observable in 
the quantum regime. In his pioneering studies, Wightman essentially defined 
quantum field theory as the theory of these expectation values Ey, which 
are distributions on the N-fold product M™ of the spacetime M. The Ey 
are essentially determined by a sequence of Lorentz invariant distributions 
Wy on M*% which are really boundary values of analytic functions defined 
on a tube domain over M‘. The symmetry properties of the Ey are rather 
complicated because in general one can interchange f; and f; only when their 
supports are space-like separated, i.e., when light signals emanating from 
their supports do not meet. It was Schwinger who took the great and profound 
step of going over to Euclidean spacetime (sometimes also called Schwinger 
spacetime) E, where time has been made pure imaginary, and looking for 
the vacuum expectation values on €”. He thus created Euclidean quantum 
field theory. Schwinger’s idea was to construct the Minkowski spacetime 
expectation values by analytic continuation from the Euclidean values. The 
Euclidean group (at least its nontranslatory part) is compact, thus allowing 
(in his words) this idea to be viewed as an extension of Weyl’s unitarian 
trick to infinite dimensional quantum systems. The Euclidean expectation 
values actually turn out to be functions, the so-called Schwinger functions, on 
E. Heuristically, the Schwinger functions are the moments of a probability 
measure defined on the space of distributions (or tempered distributions) on 
the Euclidean spacetime. Thus the construction of quantum fields has been 
reduced to the construction of certain probability measures on Euclidean 
spacetime [29a]. For free fields the measure is Gaussian and so one wants 
non-Gaussian measures to capture the essence of interacting fields. 
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In their fundamental paper [29b] A. N. Kochubei and M. R. Sait-Ametov 
have constructed such non-Gaussian measures on the space D(Q’,) of distri- 
butions on Q’,. The measures are absolutely continuous with respect to the 
free field Gaussian measure and live on the spaces of distributions with a fixed 
compact support (finite volume case). Eventually one has to eliminate this 
restriction and determine the so-called infinite volume limit. The methods 
used are variations of the methods used in the corresponding real case. They 
are quite beautiful but technical. 

One has to do these problems in all characteristics and over arbitrary local 
fields as ground fields, perhaps even on very general manifolds. The corre- 
sponding scattering theory has been largely untouched. That there must be a 
serious scattering theory in the nonarchimedean case is clear from the work of 
Harish-Chandra on the Plancherel formula for reductive p-adic groups, where 
the Plancherel measure is constructed by a process extremely reminiscent of 
scattering matrix theory. 
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Chapter 7 


Differential equations with irregular 
singularities* 


Why should an algebraic geometer be interested in 
irregular connections? 
Pierre Deligne 


The first global studies of differential equations with rational coefficients are those of 
Riemann on the hypergeometric equations. These are special cases of Fuchsian equations, 
or, equations with regular singularities. Their theory is essentially controlled by the 
monodromy action. The equations with irregular singularities tell a completely different 
story. Here the central fact is that formal solutions do not always converge. Their theory 
goes back to Fabry in 1885 who discovered the phenomenon of ramification, and the 
decisive developments came from Hukuhara, Levelt, Turrittin, and others. In more recent 
times, the ideas of Balser, Deligne, Malgrange, Ramis, Subiya, Babbitt and myself, and a 
host of others, have created a more modern view of irregular linear differential equations 
that relates them to themes in commutative algebra, linear algebraic groups, and algebraic 


geometry. 


7.1. Introduction 

7.2. Reduction theory 

7.3. Formal reduction at an irregular singularity and the theory of linear 
algebraic groups 

7.4. Stokes phenomenon 

7.5. The Stokes sheaf and the scheme structure on H'!(St) 

7.6. Reduction theory for families 


* This essay owes a great deal to discussions with J.-P. Ramis and Y. Sibuya, to Pierre 
Deligne through many conversations and letters, and above all, to my long collaboration 
with Don Babbitt. I am deeply grateful to these and many others for their patience and 


generosity. 


Ee 
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7.1 Introduction 


Linear differential equations of the n' order on a domain & in the complex 
plane have been the objects of intense attention from classical times. The 
most famous of these is the hypergeometric equation 


2 


d* ae 
z2(z— | a+b+1)z—c|—+abF =0 
(cores, + | ye — Cle 
which was studied by Gauss. It has a solution analytic at < = 0 given by 
(a, n)(b, n) 
= 2" (|z| < ]). 
Hig ea yo ane (Iz| < 1) 
n>0 
Here a, b. c are complex parameters, c ¢ {O. —1, —2....}, and, in the usual 
notation, Cains 
ial 
(Get) eco eae Oe see jp 
Ie) 


I being the classical gamma function. Dividing by <(< — 1) to make the 
leading coefficient | we see that this equation has 0 and | as singular points. 
Moreover going over to = oo by the substitution z = t~' we see that z = oc 
is also a singular point. This is an example of a general linear differential 
equation on P! = CU{oo} with rational coefficients whose poles are contained 
ina finite set D containing oo. On any simply connected subdomain of P! \ D 
one has a basis of n linearly independent solutions. It follows from this that 
if we start with a solution at a point x9 € Q, it can be analytically continued 
along any path starting and ending at x9, and not meeting D, but the final 
solution may well be different from the initial one, although it depends only 
on the homotopy class of the path. We thus have an action of the fundamental 
group of P! \ D at xp on the space of solutions at xp. This is the monodromy 
representation of the given differential equation. 

In a famous paper [1] written in 1857 Riemann undertook for the first 
time the study of the global nature of the solutions for the hypergeometric 
equation. He recognized the importance of the monodromy representation 
for this question and computed it explicitly. The fundamental group of P, \ 
{0, 1, co} = C\{0. 1} 1s the free group on two generators Ay. A;, the loops in C 
around 0, | respectively. Riemann determined the 2 x 2 matrices representing 
the loops Ay. 4;. He restricted himself only to equations whose solutions 
around a singular point (taken to be z = 0) belong to the module (over 
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the ring of functions holomorphic at z = 0) generated by the multi-valued 
functions z“(a € C). The numbers a are the so-called local exponents and 
there are two of them at each of the singular points 0, 1, oo; the restriction 
of Riemann amounts to saying that at each singular point the local exponents 
do not differ by an integer. Riemann knew that a second order equation 
with singularities at s. f, u is completely determined by the knowledge of the 
local exponents of its solutions and introduced the famous notation for the 
multi-valued function which has the given local exponents: 


SL ae 
PG 8 yee 
a’ Bp’ y 
For example 


Oe GO | 
ENG. D 76 22) =P OF a 0 cg 
l1—-c bc-—a-—b 


After Riemann it was Fuchs who took up this theme. He considered the 
class of differential equations on P! which have rational coefficients. A 
singular point of such an equation is called regular if the solutions do not 
have an essential singularity, or, assuming the singular point to be z = 0, the 
solutions have at most polynomial growth: on any open sector with vertex at 
z = 0, the solutions are O(|z|~”) for some integer N > 0 as z > 0. This 
formulation is necessary because the solutions are in general multi-valued and 
become single-valued only on simply connected domains, such as a sector. 
Fuchs considered the equations all of whose singularities are regular; they 
are called Fuchsian nowadays. He discovered the criterion for a differential 
equation with rational coefficients to be Fuchsian and developed their theory. 
Eventually this theory was put in definitive form by Frobenius, which is the 
way we learn the subject nowadays. 

That Fuchsian equations arise from algebraic geometry is a very deep fact, 
the first inkling of which goes back to Legendre and is encoded in his formula 


2 


[ dé 
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= F(1/2, 1/2. 1: k*). 


And, for 
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: dx 
K(k) =) a ————— 
6 (1 — x2)0 — k7x2) 


we have the Legendre equation 
d°K dK 
1 — k’)—~ + (1 — 3k?) —— — kK =0. 
k( ) Te a ) Tk 


The geometric meanon of this equation is that the periods of elliptic integrals 
satisfy a suitable equation of the hypergeometric type. It was Fuchs who saw 
that this was not a special happenstance but a general fact, and succeeded 
in proving that the period integrals of families of Riemann surfaces satisfy 
regular singular differential equations, thereby inaugurating one of the most 
profound aspects of our subject, namely the link between meromorphic dif- 
ferential equations and complex algebraic geometry through period integrals. 
Classical analysts also considered differential equations with doubly periodic 
coefficients (Lamé’s equation) so that already Riemann surfaces other than P ! 
were beginning to enter the picture as the manifolds on which the differential 
equations were defined. The work of Riemann and his successors such as 
Fuchs, Schwarz, Klein, Poincaré, Pochhammer and others showed that the 
theory of fuchsian equations was a beautiful structure [2]. 
Differential equations of the n” order are equivalent to first order systems 
du 
— — Au = 0, 


rg 
4 


satisfied by n x 1-vector functions u, A being ann x n matrix of functions. 
If we start with the differential equation 


a" Veh a,3d" VEE: - - ava 0 hime 


the transformation 


leads us to the above system where 
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Oso 0 

CI Oi | 0 
A= 

=—do —dj...—d, 1 1 


This equivalence is not canonical: for instance we could have chosen 


ul 


zdu 


anal qualy 


The trivial complex vector bundle on Q has a connection defined on it with 
the covariant differentiation 


d 
Vid/de = ee AG 


The horizontal sections u of this connection are the solutions of the above 
differential equation and so provide a geometric meaning to the solutions. 

The concept of Fuchsian equations can be formulated in the context of sev- 
eral complex variables also. Hypergeometric equations in several variables 
were considered by Appell and Picard among others. The general theory of 
regular singular equations in several variables was first treated by Deligne in 
an extraordinarily influential monograph*. In some sense the theory is alge- 
braic geometric. For aspects of the theory in several variables see [4a, 4b]. 

Nevertheless equations which are not regular singular also arise naturally. 
This is typically the case when we consider a family of regular singular equa- 
tions where the singularities coincide in the limit with the local data at the 
singularities changing suitably, or, as it is usual to say, become confluent. 
The Bessel equation, the Whittaker equation, the confluent hypergeometric 
equation, the Airy equation, and many others are examples of differential 
equations with rational coefficients but with irregular singularities. For in- 
stance, we start with the Riemann P-function 


0 Comrc 
P 1/2 Sec — kz 
1/2—m 0 k 


and let c — oo to get the equation 
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d-u i du = € n aaa) —, 
dz az a 


~ 


which has 0 and oo as singularities, with 0 regular and oo irregular. If we set 
u = e~'/*= W, the equation for W becomes the Whittaker equation 


d?Ww 1 k ea) 
sou gece eb pe! SS 4h) 
dz? +( 7 2? 


Such equations may also be considered on any compact Riemann surface. 
The equations with irregular singularities give rise to new phenomena, for 
example, formal solutions seldom converge and solutions typically do not 
have polynomial growth at a singular point. The general problem is to build 
a theory of solutions to such equations. Even though the solutions to such 
equations are very often highly transcendental, classical analysts discovered 
many equations that fall under this umbrella which were very important in 
applications. So analysts always have a soft spot for such functions, and it 
is natural to ask what sort of geometry is described by the solutions of these 
equations. This geometry is certainly not algebraic and its nature is still very 
much a mystery. The rhetorical quote at the top of this essay thus makes 
sense. 

Starting from the 1970s, anumber of people have been interested in looking 
at differential equations with irregular singularities. Many letters have been 
exchanged among some of them. Some of the letters involving Deligne, 
Malgrange, and Ramis have been collected in [5] and they give a good idea of 
the themes involved, the possible questions that may be of interest, and so on. 
Since variations of the periods of complex manifolds leads only to regular 
singular equations (Griffiths, Deligne) it is not clear what algebraic geometric 
significance attaches to irregular singular equations; in [5] this issue is taken 
up in depth. 

In this essay I address some questions regarding irregular singular equa- 
tions which have a group theoretic origin and which have fascinated Babbitt 
and myself very much for a long time. 


7.2 Reduction theory 


Even in the theory of Fuchsian equations a large part of the theory revolves 
around the local study of the equations around the singularities. From now 
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on I shall be concerned almost exclusively with the local theory at a singular 
point which can be taken as z = 0. The central fact about equations is that at 
any regular singularity, formal solutions are always convergent. At an irreg- 
ular singularity this is not the case; typically, formal solutions diverge. The 
classic example goes back at least to Euler who treated the (inhomogeneous) 
differential equation 


whose formal solution is the nowhere convergent series 
D 
z— lz? +2!z?—. 


Examples of homogeneous equations are the Bessel equations which have oo 
as an irregular singularity, and which have divergent expansions there. 

The basic idea is to formulate everything in a group theoretic way. So the 
idea is to try to transform the given differential equation into a simpler one 
which can be presumably solved right away. Of course one has to restrict the 
transformations that one is allowed to use. This idea can be formulated in 
group theoretic terms. If X is the space of differential equations considered 
and G is the group of allowable transformations, then G acts on X; the 
technique is then that of finding in each orbit of G in X an element which 
is especially simple so that the corresponding differential equation may be 
solved without difficulty. This is an example of reduction theory or finding 
canonical forms for the action of G on X, or simply determining G\X in 
some explicit fashion. 

Let us apply this approach to differential equations of the form 

du 

—=Au 

dz 
where u is ann x 1 vector and A is ann x n matrix which is meromorphic 
at z = 0, i.e., has an expansion 


A=A,z + Ariz’ +.. 


If the series has a positive radius of convergence A is actually meromorphic 
in some disk around z = 0, otherwise the series is only formal and we say 
A is formally meromorphic. One can transform the differential equation by 
writing 

Vie 
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where g is an invertible meromorphic matrix; the equation for v then becomes 


1 
aU Ry. peg ee a 
dz dz 
Let 
F = Ci{z}[z7'1, Fo = C[[z]] 
and let 


Pes TI), G = GL@, F), Go = GL(n, Fp): 


Let us denote the convergent elements of F by Feo. Let “A (rep. Ag, mee 
the space of formally meromorphic n x n matrices and G (resp. Geg,) the 
group of invertible elements of G. The above calculation then suggests that 
we consider the action of G on A given by 


dg _, 
a iS . 


SpA BA) = Ae ee 


Reduction theory is the description of suitable canonical elements in the 
orbits of the action of G. The formula for the action is reminiscent of gauge 
actions and so one may think of this as the gauge-adjoint action. Actually 
we have already mentioned that the solutions of the differential equations 
may be thought of as the horizontal sections of a connection whose covariant 
derivative is d/dz — A so that the above action is just the natural action 
on the space of connections. Although the point of view of connections 
on vector bundles and gauge transformations is not essential in the local 
theory, it becomes indispensable in global discussions. The analytic theory 
corresponds to the action of Geg, ON Ager. 

From this point of view the singularity < = O is regular if A can be reduced 
to the form 

Ze C 
where C is a constant matrix, i.e., there is ¢ € GL(. F) such that 
BA] ee. 


If A is actually meromorphic (not just formally) then g is necessarily in 
GL(n. Feo). A fundamental (multi-valued) solution to the equation 
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du 
—_=Au 
dz 


Bs 2(zjelos2e 
from which we see that the monodromy is 


ectic 

The solutions are thus in the classical form of meromorphic series multiplied 
by z*(log z)’. The GL(n. C)-equivalence class of e?”'© then determines com- 
pletely the G.,-equivalence class of A. An important sufficient condition for 
regularity is that A has at most a simple pole at z = 0. 


7.3 Formal reduction at an irregular singularity and the theory 
of linear algebraic groups 


The basic system of equations is 


du 


(1) ae = WA 


If the matrix A does not have a simple pole at z = 0 it is no longer necessarily 
true that the system of differential equations has a regular singularity at z = 0 
and so reduction to the form z~'C can no longer be expected. The local 
reduction theory at an irregular singularity is essentially the creation of many 
people, notably Hukuhara [6], Katz [7], Levelt [8], Turritin [9]. The group 
theoretic approach to reduction theory apparently appeared for the first time 
in {10a] and led to very interesting links with the theory of affine algebraic 
groups and their actions. I shall briefly discuss this aspect of the theory now. 
We start with a matrix A whose entries are in F, the field of formal Laurent 
series in z. Write 
Oyen = A,2 EApa 4... (r < —2, A, € gl(n, C), A, #0) 


~ 


We assume that r < —2 since the system is Fuchsian for r > —I. Let G; be 
the subgroup of Gy of elements = 1 mod z, i.e., elements of the form 
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If P,, are the spectral projections of A, one can find g € G, such that B = 
g[A] commutes with all the P,,. The system of differential equations is 
thus formally decoupled along the spectral subspaces of A, (note that in this 
reduction A, is untouched, namely, B, = A,. If the matrix A, has n distinct 
eigenvalues one can then write down 7 linearly independent formal solutions. 
But if the matrix A, is degenerate this is no longer possible. It was Fabry!! 
who discovered in 1885 that one should go over to a covering plane, namely 
to work with the variable ¢ = <!/” for some integer ) > 2 and use (formal) 
meromorphic changes of the dependent variable to generate formal solutions. 
This means that the reduction group is 


Gu GL, Fp) F, = Clete” 1. 


The system (1) is then said to be ramified. The fractional Laurent series in 
z'/? are the Puisseaux expansions and the fields F;, are Galois extensions of 
F obtained by adjoining <'/”. Of course we know that the algebraic closure 


F of F is the union of these extensions, namely, 


fae ala) i) 


b>] 


Fabry’s idea can now be formulated as the principle that we should work over 
the algebraic closure of F for reduction theory. Clearly this principle is a 
very natural one from the general point of view. 

For example, let 


000 00a 
A={100]z7+ 1/000] z' (a #0). 
010 000 


If we consider the transformation 


z FO 0 
2 = O10 
0 22/3 
then 
00a —1/30 0 


SA se = |.100 eres |) emote ieam 
010 0 01/3 


7.3. Formal reduction at an irregular singularity 189 


The leading coefficient matrix of B has 3 distinct eigenvalues, namely the 
three cube roots of —a. This determines at once the formal fundamental 
matrix as a function of ¢ = z!/3, 

It is of course not obvious how the transformation g was found. Notice 
that conjugation by g operates on each coefficient A,z‘ of A in two opposite 
directions, sending some entries of the matrix to those of a higher degree in 
z and others to ones with a lower degree in z. For this reason the transfor- 
mation g was Classically known as a shearing transformation. In the above 
example, the process stops after one application of a shearing transformation. 
In the general situation one may have to repeat this process several times 
with appropriate shearing transformations at each level. The finiteness of 
the process as well as the method of discovering the sequence of appropriate 
shearing transformations at the various levels are then not at all obvious. 

The reduction procedure is roughly as follows. Since any A can be written 
as A? + f1 where f € F and A® has trace 0, it easily follows that one can 
confine oneself to the reduction of A°. Thus we may assume that A has trace 
0. If the leading coefficient of A has at least two distinct eigenvalues one can 
decouple the system of equations along the spectral subspaces of the leading 
coefficient matrix and so one can use induction on the dimension n. Thus 
one comes down to the case when the leading coefficient matrix is nilpotent. 

Let O be the GL(n, C)-orbit of A, where we ae assuming that A, is nilpo- 
tent. The central step in our method is to construct a gauge transformation 
g € GL(n, F;) for a suitable integer b > | so that the leading term A’, of 
A’ = g[A]| satisfies one of the following: 


(a) A has order > —1 and so is regular 

(b) A’, has two distinct eigenvalues 

(c) A’. is nilpotent but lies in an affine subspace of gl(n, C) that contains 
A, and is transversal to the orbit O. Moreover the orbit of A’. is distinct 
from that of A, and has A, in its closure. 


It can now be proved that the properties described in (c) above imply that for 
the orbit O' of A’, under GL(n, C) we have 


dimc O' > dime O. 


Since there are only finitely many nilpotent orbits this inequality makes it 
clear that the reduction process terminates in a finite number of steps and 
that it will ultimately end with A being in canonical form (see below for the 
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precise definition of a canonical form). The result that the orbital dimension 
of anilpotent matrix increases when it is moved transversally is to be expected 
of course, but its proof is not entirely trivial. It depends on general properties 
of actions of algebraic groups. 

I shall now sketch this argument in more detail. The construction of 
the shearing transformation g above proceeds by first imbedding A, in a 
3-dimensional simple Lie algebra by the use of the well known classical the- 
orem of Jacobson—Morozov according to which we can find elements H and 
X in gl(n, C) such that 


[(H,XJ/=2X, |B. YJja a2 [X,Y |=, Yea. 


A simple argument shows that if we select a subspace a of g = gl(n, C) 
which is complementary to the range [Y, g] of adY, we can find a gauge 
transformation 

g= | + 2ii eel, +e 


such that 
GEA \esedye” WA! age”?! eae, 


+14 


has the property that 
A, sy, A-.9,-+- are all € a, 


One knows from representation theory of s{(2) that gy is a possible choice 
for a, where gx is the centralizer of X in gl(n, C). So we may assume that A 
itself has this property, namely, 


Arai, Ap+2; oe are all Gig. 


Now, by representation theory of 51(2), gy is stable under adH and ad(H ) has 

only nonnegative integral eigenvalues there. So we can find a basis (Zi)i<i<g 

for gy such that = 
Lt, Zl eeZe c; = 0, c € Z. 


The gauge transformation is then of the form z’” where q is asuitable rational 
number. For 
A 2=s/Al=Ae =... 
we have 
ay +Z. Zedy.7 0: 


7.3 Formal reduction at an irregular singularity 19] 


From now on the argument is entirely geometric. It is a question of showing 
that if U € Y + gx is nilpotent and distinct from Y, then the orbit Oy = 
GL(n, C)-U has dimension strictly greater than the dimension of the orbit 
Oy of Y. From the theory of orbits of algebraic groups we know that Oy 
is Zariski open in its Zariski closure Oy and that Oy \ Oy is a union of 
finitely many orbits of dimension < dim(QO,,;). Hence it is enough to prove 
that U ¢ Oy and Y¥ lies in the closure of Oy in the usual topology. If we 
write 
ZC Hie + C2, 7 0, US=Y+Z 

then 


er exp(tH)U exp—(tH) = Y + > OILY fp —Y (t—- —oo) 


l<i<gq 


For any nilpotent U’ one knows that U’ is conjugate to cU’ for any constant 
c % 0; indeed, by Jacobson—Morozov we can write U’ = Y’ for a standard 
sl(2) triple {H’, X’, Y’}, and then 


exp(tH’)U’ exp—(tH) =e 7U'". 
Going back to U which is nilpotent we see that 
, =e exp(tH)U exp—(tH) € Oy, U, > Y(t > —00). 


On the other hand, the affine subspace Y + gy is transversal to the orbit 
Oy. One can show from this that Y + gy does not meet the orbit Oy at 
points close to Y. Since the points U, of the orbit Oy come close to Y for t 
very large and negative, we see that Oy is distinct from Oy. This completes 
the argument. It is not difficult to see that z/” is essentially the “shearing 
transformation” occurring in Turrittin’s work. 

We define a canonical form as a matrix of the form 


B=D 2" 4+---+ Daz™ +z 'C 


where r) < °°* < Im < —1 are rational numbers, C, D,,..., Dm are el- 
ements of gl(n, C), the D; are nonzero and diagonal if m # 0, with the 
convention that B = z~'C if m = 0 (the regular singular case), and C com- 
mutes with all the D;. Ther; are called the canonical levels, r\ 1s the principal 
level or the Katz invariant (as it was first used by Katz in his beautiful study of 
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the arithmetic aspects of connections). The connection obtained by omitting 
the term 2~'C is the irregular part of B; the canonical form is said to be 
unramified if all the r; are integers, otherwise ramified. Then the main result 
is that there is a gauge transformation in <'/” for some integer b > | such that 


gl Ale=-B, B acanonical form 
The canonical form is essentially unique except for the changes 
Des TD TC hee 


One may therefore view the canonical form of A as the complete set of formal 
invariants of A. In particular, the canonical levels are fully gauge invariant. 


Remark 5. This method generalizes for families as I shall show later. The ra- 
tional number g in the gauge transformation 27" can be explicitly determined 
and depends only on the Taylor coefficients A,.,, for0 < m < n(|r| — 1). 
The same is true of the irregular part of the canonical form of A. This ques- 
tion was open classically and was settled only after this new method was 
developed. The bound M = n(|r| — 1) needed for this is sharp. 


Remark 6. The group theoretic method described above suggests that the re- 
duction problem (and others also) may be studied in the more general context 
where GL(n) is replaced by any complex algebraic group G and A belongs 
to the Lie algebra g of G; the groups of gauge transformations are the groups 
of F;,-points of G, namely G(F;,). The action (2a) of G(F;,) on gl(F;,) is still 
well-defined. The canonical forms are defined as before but with the proviso 
that the Bj. C lie ing. The method of reduction described above, based on 
the theory of algebraic groups, goes through with very little change and leads 
to the same results on reduction and estimates for the number of coefficients 
needed to determine the canonical form (or at least its irregular part). For all 
this see [10a, 10b}. 


Remark 7. 1am indebted to Deligne for the remark that the theory over GL (7) 
and the fact that the connections form a tensor category are enough to de- 
duce the results of reduction over any algebraic group. This is the so-called 
Tannakian approach to these problems. 
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7.4 Analytic reduction theory: Stokes phenomenon 


I have already remarked on the fact that at an irregular singularity, formal 
solutions do not always converge, and it is not clear what meaning can be 
attached to the formal solutions. In particular, the entire reduction theory 
outlined above hangs in the air without any geometric or analytic significance. 
The meaning of the formal solutions was discovered by Poincaré in 1896 
when he proved that the formal solutions are actually asymptotic expansions 
of analytic solutions at the singularity. Originally such an asymptotic result 
was proved for solutions of linear differential equations, but eventually it 
was proved in a very general nonlinear context [12a]. For a more recent 
exposition, see [12b]. It follows from this that given a formal solution s of 
a linear meromorphic differential equation with an irregular singularity at 
< = 0, one can find, on any sector with vertex at z = 0 and angle sufficiently 
small, an analytic solution which is asymptotic to s on that sector. If we have 
a meromorphic connection matrix A with a formal canonical form B (which 
we assume to be unramified for simplicity, i-e., have integer canonical levels) 
such that for some gauge transformation g € GL(n, F) we have g{A] = B, 
then on sectors with small angles we can find an analytic gauge transformation 
uw such that uw is asymptotic to g and u[A] = B. It should be mentioned that 
to reach this result the nonlinear form of the asymptotic theorem is essential. 

Consider now a connection matrix formally equivalent to a canonical form 
B. The formal reduction is effected by a gauge transformation g which 
is typically divergent. In the analytic theory one constructs analytic gauge 
transformations that are asymptotic to the formal gauge transformation g. 
As we saw above, this can be done on sectors with z = 0 as a vertex and 
with a sufficiently small angle. But, as one enlarges this sector, the analytic 
gauge transformation asymptotic to a given formal one will cease to have 
this asymptotic property when the boundary of the expanding sector crosses 
certain lines, the so-called Stokes lines. This is the Stokes phenomenon. As 
the sector moves but with the analytic solution remaining asymptotic to the 
same formal solution, the fundamental solution gets multiplied by a constant 
invertible matrices. These constant matrices are the Stokes multipliers. The 
existence of the Stokes phenomenon is clear to see; if the asymptotic property 
remains the same when the sector expands, the gauge transformation will 


become meromorphic. 
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The Stokes phenomenon is a striking example of a discontinuous behavior 
in a situation where one does not expect such discontinuity. There are many 
other situations, in the theory of partial differential equations for instance, 
where such discontinuous asymptotic phenomena are encountered. See the 
paper of Friedrichs [13] for a very illuminating discussion of such discon- 
tinuous asymptotic phenomena, and [12a] for a treatment of many examples 
of these in the theory of ordinary differential equations. Unlike the situation 
with the regular singular equations where monodromy essentially determines 
everything, it is the Stokes phenomenon that proves decisive for irregular 
singular equations. 

To make the Stokes phenomenon more precise let us consider a meromor- 
phic connection matrix A and a canonical form B such that for a formal gauge 
transformation g we have g[A| = B. This equation can be rewritten in linear 
form as 


dg 
— = Bg—gA 
dz a 


so that the asymptotic theorem can be applied. By going over to a suitable 
covering plane we may assume that everything is over G = GL(n, F). We 
now introduce a covering of the unit circle by sectors [; on each of which 
there is an analytic gauge transformation g; asymptotic to the formal gauge 
transformation g on the sector I’;, and g;[A] = B onT;. If 


Rie = 2a, 
then gj, is asymptotic to 1 on’; NT, = T jx and 
gj[B] = B 


The gj, are thus gauge automorphisms of B that are asymptotic to 1 on the 
overlaps [; VT, = jx. It is easy to verify that the {gjx} verify a cocycle 
condition. The condition that there is a meromorphic h such that h{A] = B is 
that the cocycle be a coboundary. The cohomology class defined by (g jx) is 
thus an obstruction for the meromorphic equivalence of A and B. This class 
is an analytic invariant of the given connection A under the group Geor. 

This cohomological point of view was pioneered by Malgrange and Sibuya 
who proved [14a, 14b, 15b] the remarkable result that the Stokes class of a 
meromorphic connection is a complete invariant of the connection for the 
group Gea, Namely, that H'(St(B)) classifies the systems that are formally 
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equivalent to B with respect to the relation of analytic meromorphic equiva- 
lence. 


7.5 The Stokes sheaf and the scheme structure on H! (St) 


Let us begin with a very classical irregular singular equation, the Bessel 


equation. It is 
d? ldy y? 
z+oF +(1 )y=o. 


dt? tat t? 


where v € C is acomplex parameter. The equation is considered on P' and 
it has 0, oo as singular points, with 0 regular and oo irregular. We go to the 
associated first order system and to z = 1! 
singularity. The system is 


du 0 —! 
a Ay ’ A, = ou ° 
dz ‘ (, —v?z? z ) 


We can find g, € GL(2, C{[z]]) such that 


so that z = 0 is the irregular 


ee 
= 


ee ee, (1/20 
am: (02:) +2 (os iar 


The connection matrix B is a canonical form and does not depend on v. 
Thus the Bessel family is isoformal. The calculation for g, shows that 
g, € GL(2. C{v][[z]]). Thus the reduction to the canonical form depends 
holomorphically, even polynomially, on v. 

To see what the Stokes sheaf St = St(B) is, note that the equation g[B] = 
i 0 
0-i 
for Bo is obtained by a trivial integration, namely, 


et QO 
y od ( 0 ee) 


) : gvl[Ay] =B 


where 


B reduces to g[ By} = Bo where By = ee ( } The fundamental solution 
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so that g[ Bo] = Bo is equivalent to gy = wC where C is aconstant matrix. 
Thus the Stokes conditions become 


= _ [ab 
yCw'~1, Sallis 


daber 10 
cetilt OL 


For z in the upper arc of the unit circle this means that c = 0 while for the 
lower arc we must have b = 0. Thus the Stokes sheaf is a sheaf of unipotent 
groups; the stalk is the upper triangular group 7~ for z in the upper arc, and 
the lower triangular group T~ for z in the lower arc. At z = +1 the stalk is 
the identity. The Stokes lines are the rays through z = +1. We can take the 
covering consisting of the arcs U* where z #4 —1, and U" where z $ 1. It 
can be shown that this covering is good, i.e., the entire cohomology can be 
recovered from it. Hence 


Or 


H'(St) =T* «T- ~@. 


On 7* we use the nonvanishing off-diagonal entry as the coordinate. A 
calculation shows that the cohomology class of the Bessel connection A,, is 
given by the element of H'(St) with coordinates 


(cos 277 v, — cos 27 Vv). 


Sée [5b]; 

It is a remarkable fact that for any connection, the corresponding Stokes 
sheaf is a sheaf of unipotent subgroups of GL(n, C). By looking at specific 
coverings and using the fact that unipotent group varieties are affine spaces, 
one can show that the cohomology associated to good coverings is an affine 
space [14c, 14d]. But the problem of showing that this affine structure is 
independent of the covering involves technical complications since we are 
dealing with the cohomology of a sheaf of nonabelian groups. In [15b] this 
question is treated by a method originating from a suggestion of Deligne 
[5, 15a]. Roughly speaking, we think of the sheaf S of complex unipotent 
groups as a sheaf of untpotent group schemes over C. This means that for any 
C-algebra R we have a sheaf S(R) of groups and we can speak of H'(S(R)). 
We thus have a functor 
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Rt > H'(S(R)) 


and what one should show is that this functor is representable by affine space. 
This is done in [1 5b]. The existence of a scheme structure on H! (St) is quite 
remarkable. 


7.6 Reduction theory for families 


In classical analysis many of the natural special transcendentals like the 
Bessel, or Whittaker functions depend on parameters, and it is a remark- 
able fact that the formal equivalence classes of the members of these families 
do not change with the parameter; indeed we have seen this for the Bessel 
family above. So these families are isoformal, and it is a natural question 
to analyze the corresponding Stokes phenomena. An important question is 
whether the formal reduction depends analytically on the parameters. If this 
can be done, it would follow that the map into H'!(Sr) taking the connections 
of the family into their Stokes classes is also analytic, thus making H!(St) a 
moduli space {15b]. The consideration of such questions reveals a very un- 
expected and impressive landscape and I want to discuss some of the themes 
that arise. 

To treat such parametric families new methods are needed. The connection 
matrix A now depends holomorphically on a parameter A and one wants to find 
a gauge transformation also depending holomorphically on X that effects the 
reduction to acanonical form, presumably also depending holomorphically on 
i. The approach discussed earlier encounters problems because the Jordan 
form of the nilpotent matrix which is the leading coefficient of A is now 
typically discontinuous in d. This is impossible to overcome unless a different 
approach is used. The new approach is to consider reduction theory not over 
the complex numbers, but over the ring of holomorphic functions of 4, and to 
generalize the earlier method to this context. Readers who are familiar with 
algebraic geometry will recognize this as the relative approach to reduction 
theory. The issues are: what are the Jordan forms of nilpotent matrices 
over rings, whether the Jacobson—Morozov is true over rings, and whether 
one can prove the increase of dimension of nilpotent orbits under transversal 
deformations. The theory is somewhat subtle and is treated fully in [10c]. 
Here we sketch only some high points. For a bird’s eye view see [10b]. 
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The first step is to find the Jordan forms of a holomorphic nilpotent matrix 
under similarity by invertible holomorphic matrices, or more generally, over 
fairly arbitrary rings. This question has surprising ramifications. For instance, 
unlike what happens in the case of complex matrices, there may be infinitely 
many nilpotent orbits, and even moduli. 

Thus, over the ring ©, of holomorphic germs at A = 0, the matrices 


0 0 
(.. 5) (Onl 2 ee) 


belong to distinct orbits under GL(2,O,). Indeed, all the entries of the 
matrices of the form UY, V belong to the ideal generated by A” and so Y,, 
cannot be conjugate to Y,, under GL(2. O,,) ifn < m. Notice that, over C 
and for fixed m, all members of the above family for A # O have the Jordan 


form 

00 

10 
but at A = O the matrix is just 0, showing the discontinuity of the Jordan form. 
The Y,,, exhaust the Jordan forms. To see this, note that for a nilpotent L we 


can find a nonzero element wu of O; in the kernel of L, and we may assume 
that one of its two components is a unit. We can then find v such that {v. w} is 


; . wapltt 
a basis for O2. In this basis L has the matrix ( 
: 


0 
alk and as L- = 0 we must 


have a = 0. If we write c = gi" where g is a unit, conjugacy by ¢ “| 
&§ 


reduces L to Y,,. 
For 3 x 3 matrices with Jordan form over the quotient field as 


000 
100 
010 


the Jordan forms over ©, can be shown to be 


0 00 
| Oe =A" 0 0 
h "0 
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where h is a polynomial such that 
degree (h) < min(m,, m). 


Vet 
Min = {Limy.mg.n|degree (h) < min(my, m2)}. 


Then the orbit space for these nilpotents has two discrete parameters m,,m> 
which are integers > 0, and within each stratum defined by fixing the values 
of these two discrete parameters, it is a continuum, namely 


M as cinta 1/2) 
m),m2 — 


This situation remains true more or less broadly in general. Within a con- 
jugacy class over the quotient field of O,, the orbits of nilpotents under 
GL(n, O,) can be first stratified by discrete parameters which we shall call 
the discrete invariants, and once these are fixed, the orbit space becomes a 
constructible set. These remarks suggest that there is much interest in dis- 
cussing the structure of the adjoint orbits over function fields for a semisimple 
Lie algebra. 


Jacobson—Morozov 


The theorem of Jacobson—Morozov does not survive over rings. Indeed, if 
10 
there is atriple {H, X. Y = Y,,} over O,, then H must be of the form (: a ) 


but as H = XY — YX, its entries must all be divisible by 4". Moreover, for 
a given Y, even H may not exist: Ly, 5., has an associated H if and only if 
he: 

To overcome these difficulties we need to work with invariants for a nilpo- 
tent matrix which do not depend on Jacobson—Morozov. Let R be a Noethe- 
rian domain with quotient field K, and L a nilpotent matrix, viewed as an 
endomorphism of R”. Using a triple {H, X,Y = L} over the field K we 
define W, « as the span of the eigenspaces for H of eigenvalues < r. Then 
(W, «)rez is a filtration, finite and increasing for increasing ¢ with the prop- 


erties: 


Coma GW. 22K - 
(ii) If Wi = Wx«/W,-1.x, then for all j > 0 the map L/ induces an 


isomorphism L; : Wj.« ~ W_jx. 
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The filtration (W, x) is uniquely determined by L and is independent of the 
choice of H, X. It is the “Hodge filtration”; see [16]. We then define 


W, — WK al Ro 


But now L; is no longer an isomorphism and so we can introduce the R- 
modules = 

M,(L) = W_;/L;(W;) 
which are invariants of L under GL(n, R). 

From general principles of commutative algebra the most important case 
is when R is a discrete valuation ring. Let us now assume from now on that 
R is a discrete valuation ring (for example O,). Then the M;(L) are finite 
dimensional vector spaces over the residue field of R and we can define 


d(L) =dimM,(L) (Gj = 1). 


The d;(L) are the discrete invariants of L. For L = Y,, forn = 2 we 
have d; = m,d; = O(j > 1); form = 3,L = Emm, we have dz = 
m,+m 2, d; =0(j # 2). Fora given nilpotent there is an integer N > 1 such 
that d; = 0 for j > N and one can determine the d; recursively downwards, 
starting with dy. Let us define for any nilpotent L, the vector d(L) by 


d = d(L) = (dy (L), dy-1(L), ..., 4 (L)). 


We regard d(L) as an element of Z” the latter being given its natural /exi- 
cographic ordering. The vector d(L) is an invariant of the GL(n, R)-orbit 
Gh i; 


Transversal nilpotent deformations 


It is now natural to ask what happens when we consider an analytic family 
L(e) of nilpotents with L(0) = L. For this I need to explain what an admis- 
sible nilpotent is and what is meant by a strictly transversal deformation of a 
nilpotent. We shall assume actually that R is a C-algebra and that the residue 
field of R is C, asin R = O,. 

Since R is a discrete valuation ring the torsion free nature of the modules 
W,/W, (s < t) implies their freeness and thus the existence of submodules 
W" such that W, = W,_, @ W™: the element H ¢ End(R") defined by 
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requiring that Hu = tu for u € W" is said to be associated to L. It is not 
unique since the W are not unique in general. We say that L is admissible 
if there is an H associated to L such that [H, L)| = —2L. 

A given L need not be admissible; however, if we expand L in terms of the 
eigenspaces in End(R") for ad(H), itis clear that L = L_>+ L_3+... where 
L, isin End(R") with (H. L,| = rL,, from which it follows that L’ := L_> is 
an admissible nilpotent such that H is associated to L’, and what is decisive, 
L' has the same filtration as L and the same torsion modules M;. From the 
above remarks it follows that 


d(L) = d(L’). 


Thus admissibility is not such a strong demand and one can work around 
nonadmissibility in reduction theory. For instance, for L = Ly,, ..5.,, admis- 
sibility happens if and only if h = 0 and we can take L’ = Ly, 1.5.0. 

Strict transversality is a sharpening of transversality over C. Let S be an 
ad(H)-invariant C-linear subspace of End(R”) which is complementary to 
the range of ad(L); such an S exists and let S*be the subspace of S spanned 
by the eigenvectors of ad(#/) for the eigenvalues > —1. 


Definition. The family L(e) of nilpotents (L(0) = L) in End(R”) is strictly 
transversal if L(e) € L + S* forall e. 


The basic theorem of deformations of nilpotent matrices 1s now the following. 


Semicontinuity theorem. /f L is admissible and is moved strictly transver- 
sally within the class of nilpotents over R to a nilpotent L' ¥ L, then either 
the orbital dimension of L' over the field K is strictly bigger than that of L, 
or the nilpotents stay in the same orbit over K but the vector A(L’) ts strictly 
less than the vector d(L) in the lexicographic ordering of Z “We denote this 


situation by 
Lee L. 


In analogy with what happens in the case of C it would be natural to say 
that the family is transversal it L(e) € L +S for alle. If R = C it is 
automatic that S = S+ but this not true always for general R, so that what 
we have is a sharpening of the transversality condition. As an example of a 
deformation which is transversal but not strictly, consider L(€) = Ly) nos 
with m,,m> > 1. Then L(e) = L(O) + €Z where 
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000 20 0 
Zi= | 0100 Hes) 00 L016 [, AS 4zZ. 
100 00-2 


As a consequence of the semicontinuity theorem we have the almost exact 
analogue of what is true over C. 


Theorem. Let notations and assumptions be as above, If Z #0 € S* and 
L’ = Lo + Z is nilpotent, then L’ <~< Lo. 


Reduction theory over general rings 


We introduce a general ring R, which for simplicity we assume to be a Noethe- 
rian integrally closed domain with quotient field F. We have a connection 
matrix 

Aa A Ae oe (r < —2) 


where the A ; aren xn matrices over R. The aimis to try to get canonical forms 
for such matrices under the action of gauge transformations which are Laurent 


'/> with coefficients in R. Inspired by the above discussion for 


series in Z 
canonical forms for nilpotent matrices we assume that over K the connection 
has a canonical form. However the canonical form will be defined over the 
algebraic closure of K and so will bear little resemblance to a connection 
over R. We therefore assume that the canonical form is really defined over R. 
Moreover it is quite possible that the different elements of the joint spectrum of 
een @ Bane Os egeeae D,,) may have contacts which is an example of the turning 
point phenomenon. We shall say that the connection is well-behaved if this 


is not so. 


Definition. Let R be a Noetherian integrally closed domain of characteristic 
0, and K be, field of its quotients. Let A be a connection matrix whose 
Laurent series has coefficients which are matrices over R. We say that A is 
well behaved if it is equivalent to a canonical form (which is also called well 
behaved) 

B= Dz" +--++Dyz™ +27'C 


under the group of gauge transformations which are Laurent series in z!/” 
over the algebraic closure of the field K where 


(1) the D; are diagonal matrices with entries in R 
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(2)if Qi, .... Am) and (A), ..., A’) are distinct elements in the joint spec- 
trum of(D,,..., D,,), andifk is the first of the integers i for whichA; # Ve 
then A, — A) is a unit of R. 


The concept of well behaved connection matrices is somewhat strin- 
gent. There are examples of connections that are not well behaved and 
which cannot be reduced to a canonical form under GL(n, ikea) where 
Roce = Oper Rie’ lz "1; see 0c), pp. 85-87. Still it is a useful con- 
cept, especially for isoformal families, namely families where the canonical 
form over the complex numbers does not change. This is the situation with 
most classical families—Bessel, Kummer, Whittaker, etc. In fact, we have 
the following useful result. 


Proposition. Let R be the ring of analytic functions on some fixed polydisk 
in C4, and let A € gl(n, R{[z]][z7']) be such that for all X in that polydisk 
the complex connection matrices A(A) are unramified and have the same 
canonical form. Then A ts well behaved. 


For well behaved connections on suitably restricted rings one can establish 
a reduction to canonical form. I give here one of the many possible results. 


Theorem. Let R be a Henselian discrete valuation ring over C with residue 
field C, for instance O,. Let A € gS(n, O,{[z]][z~']) be a well behaved 
connection. Then A can be reduced to a canonical form. 


This result is essentially the one parameter case of parametric reduction 
theory. For rings of analytic functions and function germs it is possible to 
extend Theorem 1 rather completely. 

For analytic families of connections whose canonical forms do not depend 
on the parameters (isoformal families, we have the following result, stated in 
a sharper form than the preceding theorems, in terms of reduction over R[[z]] 
itself. 


Theorem. Let A be a polydisk in C“ centered at the origin and let R(A) be 
the ring of holomorphic functions on A. Let 


A €gl(n, R(A)[[z]][z7') 


be a connection such that A(A) is equivalent to A(O) under GL(n, C) for 
alli € A. Then there exists a concentric polydisk A, C A andan x € 
GL(n, R(A,)[[z}][z7')) such that x{A] = A(0). 


204 
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The proof of the reduction theorem in the henselian case follows closely 
the proof in the absolute case and uses the semi-continuity theorem and its 
corollary. The extension to polydiscs needs additional arguments, see [10b, 


10c]. 


The extension of these results to connections that may exhibit tuning 


points is a major unsolved question. For turning points, see [17]. 
Of course this survey just touches on those aspects that I have been able 
to say something about. For other aspects, see the letters in [5]. 


References 

[1] B. Riemann, Collected Papers, Springer-Verlag, 1990, 99-119. 

[2] J. Gray, Linear Differential Equations and Group Theory from Riemann to 
Poincaré, Birkhauser, Boston, 1986. 

[3] P. Deligne, Equations Différentielles a points singuliers régulier, Springer Lecture 
Notes in Mathematics, Vol. 163, 1970. 

[4a] M. Yoshida, Fuchsian Differential Equations, Vieweg, 1987. 

[4b] V.S. Varadarajan, Some remarks on meromorphic differential equations with sim- 
ple singularities, Calcutta Math. Soc. Diamond Jubilee Volume (1983), 49-61. 
Selected Papers, 427-440, AMS 1999. 

[5] P. Deligne, B. Malgrange, and J.-P. Ramis, Singularités Irréguliers, Correspon- 
dances et Documents, Soc. Math. de France, 2007. 

[6] M. Hukuhara, Selected Papers, 1997. 

[7] N. Katz, Nilpotent connections and the monodromy theorem: application of a 
result of Turrittin, Publ. Math. IHES. 39(1970), 207-238. 

[8] A.H.M. Levelt, Jordan decomposition for a class of singular differential opera- 
tors, Ark. Matematik, 13(1975), 1-27. 

[9] HH. Turnitin, Convergent solutions of ordinary differential equations in the neigh- 
borhood of an irregular singular point, Acta Math., 93(1955), 27-66. 

[10a] D.G. Babbitt, and V. S. Varadarajan, Formal reduction of meromorphic differential 
equations: a group theoretic view, Pacific J. of Math., 108(1983), 1-80. 

[10b] V.S. Varadarajan, Linear meromorphic differential equations: a modern point of 
view, Bull. AMS., 33(1996), 1-42. 

[10c] D. G. Babbitt, and V. S. Varadarajan, Deformations of nilpotent matrices over 
rings and reduction of analytic families of meromorphic differential equations, 
Mem. AMS., vol. 55, No. 325, 1985, 1-147. 

[11] E. Fabry, Sur les intégrales des équations différentielles lineaires a coefficients 
rationels, These, Paris, 1885. 

[12a] W. Wasow, Asymptotic Expansions for Ordinary Differential Equations. Dover. 
1987. 

[12b] D. G. Babbitt, and V. S. Varadarajan, Some remarks on the asymptotic existence 
theorem for meromorphic differential equations, J. Fac. Sci. Uni. Tokyo, Ser. IA.. 
36 (1989), 247-262. 

[13] K. O. Friedrichs, Asymptotic phenomena in mathematical physics, Bull. AMS.., 
61 (1955), 485-504. 

[14a] Y. Sibuya, Stokes phenomena, Bull. AMS., 83 (1977), 1075-1077. 

[14b] B. Malgrange, Remarques sur les équations différentielles a points singulier ir- 
réguliers, in Springer Lecture Notes in Mathematics, Vol. 712. 1979. 

[14c] W. Balser, W. Jurkat. and D. A. Lutz, Birkhoff invariants and Stokes multipliers for 


meromorphic linear equations, J. Math. Anal. Appl. 71 (1979), 48-94; A general 


References 205 


theory of invariants for meromorphic differential equations, Funkcialaj Ekvacio}, 
22(1979), 257-283. 

[14d] W. Balser, Zum Einzigkeitssatz in der invariantentheorie meromorpher Differen- 
tialgleichungen, Jour. Reine und Angewandte Mathematik, 318(1980), 51-82. 

{15a] Deligne letter to VSV in [5]. 

[1Sb] D.G. Babbitt, and V. S. Varadarajan, Local Moduli for Meromorphic Differential 
Equations, Astérisque, 169-170, 1989 

[16] P. Deligne, La conjecture de Weil, Publ. Math. IHES, 52 (1980), 137-252. 

[17] W. Wasow, Linear Turning Point Theory, Applied Mathematical Sciences, 54, 
Springer-Verlag, 1985. 


Chapter 8 


Mackey, Harish-Chandra, and representation 
theory* 


May the good things come to us from everywhere. 
Rig Veda 


Some personal reminiscences of Mackey and Harish-Chandra together with brief com- 


ments on their work. 


8.1. George Mackey and his view of representation theory 
8.2. Harish-Chandra as I knew him 

8.3. Some reflections 

8.4. Fourier analysis on semisimple groups 


8.1 George Mackey and his view of representation theory 


I first met George Mackey in the summer of 1961. He came to the University 
of Washington, Seattle, as a distinguished lecturer and gave twenty lectures, 
lecturing every day for four weeks. The first half of the series dealt with 
representation theory and the second half with quantum theory. He was at 
the peak of his powers and achievements and the lectures were a revelation 
to me. His imagination, his ability to understand the core issues, and his free 
and easy charm and accessibility just blew me away. Ina very essential sense 
I became his student, and the impact of that one month has stayed with me 
for ever. On the occasion of a posthumous retrospective on his work held by 
the American Mathematical Society I wrote [la]: 


. It was a great opportunity for me to see a great master at work, and that 
encounter shaped my entire mathematical career. | became a great admirer of his 


*“ Over the years | have spoken on many occasions on the work of George Mackey and 
Harish-Chandra on representation theory, and what it has meant to me. This essay draws 
upon my articles as well as of my memories. 
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way of thinking that encompassed a broad picture of mathematics and physics, 
and emphasized concepts above brute calculation and ideas above technique. I 
was deeply impressed by the curiosity as well as humility with which he viewed 
the role of mathematics and the mathematician in the understanding, description, 
and interpretation of the world of phenomena around us. 


I had many opportunities for long chats with him during this visit. Our 
topics of discussion were quantum theory and representation theory, the twin 
themes of his lectures. In quantum theory I was fascinated by his axiomatic 
approach which took off from von Neumann’s theory of the quantum logic 
as a projective geometry. When I returned to India in 1962 I gave a series 
of lectures at the Indian Statistical Institute which were then turned into the 
two Van Nostrand volumes, eventually republished in a second edition, by 
Springer, as a single volume. The Van Nostrand version was also a single 
volume; but, Professor Marshall Stone, who was the editor of the series, 
thought that it was better to split them into two volumes. I must mention that 
throughout this encounter Professor Stone behaved with perfect courtesy and 
consideration, making this experience thoroughly gratifying for me. 

As for representation theory, I had read parts of Weyl’s Classical Groups 
and so was interested in representations of semisimple Lie groups. Mackey’s 
advice was simple: read Harish-Chandra’s papers. He said that Harish- 
Chandra had a “terrifying technique” of Lie algebras and it was essential, for 
anyone interested in the theory of semisimple groups and their representations 
to understand this technique. It was an advice that turned around my entire 
research career and launched me on my long love affair with semisimple 
groups. 

When Calvin Moore organized a celebration of Mackey’s 70th birthday he 
asked me to say a few words. He mentioned that Mackey had always regarded 
me as a “half student” because of the encounters in Seattle. I was certainly 
proud of this. Years later when I became interested in the representation theory 
of super Lie groups, it was his theory of representations of semi direct products 
that I extended (with my collaborators in Genova) to the supersymmetric 
context [2]. This work has now been continued by Hadi Salmasian to super 
nilpotent groups [3]. 

The articles and reminiscences in [1a, 1b] paint a very vivid and moving 
picture of Mackey, his mathematics, his almost childlike simplicity, and his 
profound vision. There is therefore nothing I can add to this portrait except 
to say that it was a very uplifting experience for me to have such a great 
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mathematician take interest in a young and unknown person like me and 
really make genuine suggestions. 

He was unique in many ways. As far as I know he did not work under 
contracts and did not participate in political activities that are common in 
the world of science. He had a vision of mathematics and physics that was 
unique and he stuck to it, long after his work became less fashionable. I have 
always admired him for this attitude. He came up in an era of giants but 
was able to hold his own quite well. In his understanding of quantum theory, 
especially from the group theoretic point of view, he was the true successor 
to von Neumann and Wey|. 

Mackey’s program, although he never explicitly described it as such, may 
be described as follows: to determine the unitary irreducible representations 
(UIR) of all locally compact second countable ((Icsc) groups, and to decom- 
pose important representations into irreducible constituents. The origins of 
the program certainly go back to the result proved in 1943 by Gel’fand and 
Raikov to the effect that the UIRs of any Icsc group separate its points. This 
suggested that it would be worth looking at the unitary representation theory 
for such groups. For compact and abelian groups a substantial theory was 
already developed in the years 1920-1940. 

Many people started the study of representation theory and harmonic anal- 
ysis for the Icsc groups. But this class of groups seemed too wide for detailed 
work and only some general results could be obtained. Mackey wisely con- 
centrated his attention on certain specific aspects. His work on the extension 
of the Stone-von Neumann theorem to general locally compact abelian groups 
led him naturally to the concept of a system of imprimitivity and the closely 
related theory of induced representations. These were originally studied for 
finite groups by Frobenius who was one of the founders of the representa- 
tion theory of finite groups. Mackey saw that the notions of Frobenius made 
sense for the general locally compact groups and developed the appropriate 
generalizations. This proved to be a very fertile ground and led him to the 
representations of semi direct products, and the classification of their repre- 
sentations in terms of orbits and representations of the stabilizers of points in 
the orbits. If G = A x’ A is asemi direct product with A abelian, then the 
UIRs of G are classified by the orbits O of H on A, and for a given O anda 
point x € O, by the UIRs of the stabilizer Hy of x in H. This classification 
has a caveat, namely, that the action of H on A be regular. This was a new 
idea, regularity being a measure theoretic restriction: an action of a Icsc group 
R onalesc space X is regular if the orbit space X/R is countably separated, 
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or what is the same thing, if there are countably many R-invariant Borel func- 
tions on X separating the orbits of R. In most natural applications (such as 
the Poincaré group so important in relativistic physics) one encounters only 
regular semi direct products. It was left to Effros, a student of Mackey, to 
prove the beautiful result that regularity is equivalent to the condition that all 
orbits be locally closed, i.e., open in their closure. Nonregular actions are 
generally called ergodic, for instance the action of Z on the circle group T 
by an irrational rotation. If the semi direct product is ergodic, then there are 
many more UIRs of G than the ones coming from the orbits and stabilizers. 
The conventional wisdom is that in this case one cannot classify the UIRs, 
but it may not be too much to hope that partial classifications may be sought 
for. 

This work of Mackey brought out surprising aspects of representation the- 
ory of Icsc groups and their relationship to some very sophisticated aspects 
of measure theory, such as the standard Borel structures. One example of this 
was the following. Let G be a Icsc group and G be the set of equivalence 
classes of UIRs of G. Since G is Icsc, all its UIRs are in separable Hilbert 
spaces, and one can take concrete models of these Hilbert spaces (one for 
each dimension n = 00. 1,2,..., say €*. C, C*....) and think of the set of 
all UIRs, say X, in a concrete manner, with a relation ~ of unitary equiva- 
lence. Clearly we can identify X/ ~ with G. X is astandard Borel space but 
G need not be as the relation = may be ergodic. The group G may be said 
to have a smooth dual if G is standard. The Mackey philosophy was that a 
classification of UIRs of a Icsc group 1s possible if and only if the group has 
a smooth dual in this sense. He conjectured that a group has a smooth dual 
if and only if all of its factor representations are direct sums of irreducibles. 
Here a factor representation 1s one whose commutant is a factor in the sense 
of Murray and von Neumann, namely, its center consists of only the scalars. 
Such a group is said to be of type 7. Glimm proved Mackey’s conjecture; later 
other proofs were obtained by Effros. Whether a group is type I or not is a 
deep question; Harish-Chandra proved that real semi simple Lie groups are 
type | and Dixmier extended this to include all real algebraic groups. Bern- 
stein proved that reductive groups over local fields are type I. These results 
however use ideas and techniques far removed from the Mackey universe. 
The work of Glimm and Effros was very fundamental in the clarification of 
these ideas. Mackey also showed that for a type I group a very satisfactory 
theory of decomposition of unitary representations can be built by expressing 
any representation as a canonical direct integral over the dual. For multiplic- 
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ity free representations this gives a bijection with measure classes on the dual 
of the group. Perhaps this work of Mackey will be the foundation for a theory 
of unitary representations of groups defined over locally compact rings that 
will unify the theories currently pursued over local fields and adele rings. 

His success with the theory of semi direct products led Mackey to the fol- 
lowing very general question: if G is a Icsc group and N a closed normal 
subgroup, how to build the representation theory of G in terms of those of N 
and H = G/N? It is natural to assume that N is of type I and decompose a 
UIR of G by restricting it to N via projection valued measures on N. There is 
a natural action of H on N and one can expect a good theory if this action is 
regular. However Mackey found that the analogue of the semi direct product 
theory can be carried out but one will have to know the theory of projective 
unitary representations of H. This appeared very discouraging at first but 
Mackey showed that the descent from G to N and H could be accomplished 
if one works with projective representations from the very beginning. For 
instance, for low dimensional nilpotent groups G one can get a comlete de- 
scription of G. But this approach was superseded when Kimllov discovered a 
very simple and direct way to construct UIRs of nilpotent Lie groups via orbits 
in the Lie algebra, thus beginning the rather profound connection between 
orbits and representations. Kirillov’s theory was extended in a deep manner 
to solvable groups by Moore, Pukanszky, Auslander, and Kostant. However 
it must be pointed out that Mackey did not restrict himself to Lie groups, and 
his theory subsumed projective representations. A general theory of nilpotent 
or solvable groups and their duals which might contain the Lie group theory 
as a special case may still be possible. 

His work on projective representations led him to a beautiful theory of 
central extensions of Icsc groups. He discovered that one has to go beyond 
continuous cohomology and work in the Borel category. His student Calvin 
Moore pushed Mackey’s theory in many remarkable directions, calculating 
the central extensions of p-adic and adelic linear groups, thereby revealing 
their connections with classfield theory and arithmetic [4]. 

Finally, Mackey was the first person to discover that ergodic actions were 
worth investigating in a major way. He did not push is ideas too strongly, 
but another of his students, Robert Zimmer, went very far in linking the type 
of measure theory and group actions that was typical in the Mackey world 
to questions of great depth and impact in the theory of discrete subgroups 
of semisimple groups and the work of Margulis [5]. To me, the ideas that 
link measure theory, Borel cohomology, ergodicity, and so on, to semisimple 
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groups are among the most surprising that have come out of the work of 
Moore, Zimmer, and others. 

Mackey’s ideas also had substantial impact on semisimple groups. His 
double coset formula for the intertwining numbers of induced representations 
of finite groups was the inspiration behind Bruhat’s work on the irreducibility 
of the minimal principal series for semisimple groups. Furthermore, Roger 
Howe who was a student of Calvin Moore, applied Mackey’s ideas in combi- 
nation with his own in a decisive way to develop a beautiful theory of small 
representations of metaplectic groups [6]. Howe’s work was continued by 
his students Jian-Shu Li and Hadi Salmesian to include all classical groups 
and most, if not all, semisimple groups over local fields of characteristic 0 
[7a, 7b]. 

As I think over Mackey’s life and his achievements I have come to realize 
that he was in many ways an extremely atypical figure in the American aca- 
demic milieu: very introspective, not very concerned with self-promotion, 
content to follow his own vision even when it became unfashionable, and 
working almost to the end of his life trying to communicate his views on 
a large part of mathematics to the next generation. He had a large number 
of students, many of them elite mathematicians, who grew up in his world, 
made beautiful discoveries of their own, and continued his vision through 
their students. It was my greatest good fortune that I was able to interact with 
him at a crucial stage in my career. 


8.2 Harish-Chandra as I knew him 


I came to the mathematics department of Princeton University in January 
1960 as a postdoctoral fellow to spend the rest of the academic year there. 
Sometime during that stay Professor Herbert Robbins of Columbia University 
invited me to give a talk in his statistics seminar. Robbins was very kind and 
wanted to know if there was anyone I wanted to meet specially. I suggested 
Harish-Chandra and he took me to meet him after my talk. 

That meeting was very brief as Harish was leaving for a dinner with the 
Colloquium speaker. He told me that to work in semisimple groups one must 
know the three volumes of Chevalley backwards (which is an Indian idiom 
meaning total mastery), and also that I should know “roots.” He must have 
understood at once that I was a novice and refrained from mentioning his 
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own work. I had already begun looking through the Chevalley volumes and 
knew that they treated Lie groups not only as transcendental objects but also 
as objects in the algebraic category. But I did not know what roots meant, nor 
was | aware of the Séminaire Sophus Lie and Séminaire Chevalley. | would 
get to know about them in a couple of years. 

Even before I left for the US we had a small group consisting of myself, 
Ranga Rao, K. R. Parthasarathy, and Varadhan, at the Indian Statistical Insti- 
tute. Actually Varadhan came almost when I was about to leave for the US. 
Parthasarathy, Ranga Rao, and Varadhan succeeded in developing the com- 
plete theory of infinitely divisible probability measures and the associated 
limit theorems for arbitrary locally compact abelian groups, generalizing the 
classic work of Kolmogorov and Gnedenko to this context. Varadhan had 
then extended this theory in a remarkable manner to Hilbert spaces. When I 
returned to the Indian Statistical Institute in the Fall of 1962, the only mem- 
ber of that group still there was Varadhan. I have discussed elsewhere [8] 
about that remarkable year of working with him. We made the almost insane, 
certainly very courageous, decision to see if we could understand Harish- 
Chandra’s work on infinite dimensional representations of semisimple Lie 
algebras, when the underlying group was complex. For infinite dimensional 
representations coming from the group G it was very clear from his work that 
the basic object was the pair (g, &€) where g = Lie(G) and @ = Lie(K), K 
being a maximal compact subgroup of G. When G is complex, the imbed- 
ding of € in g can be specified algebraically (this is already in his papers, even 
explicitly), so that the infinitesimal theory becomes entirely algebraic. After 
complexification, (g, &) becomes isomorphic to (gc x gc. gc), the second 
factor being diagonally imbedded in the first. 

Varadhan left for the Courant Institute in 1963 but by that time Ranga Rao 
and Parthasarathy had come back and we spent the next two years pursuing 
the project begun with Varadhan. This period, 1963-65, was one of the most 
wonderful in my career. Calcutta was a very difficult place to live, with 
recurring power outages, rationing of essentials like rice and sugar, and other 
disruptions of daily life. The war in Vietnam was heating up and Calcutta, one 
of the most politicized centers in India, was in an uproar over this. The only 
constant was our work. We would work nonstop every day. We would read 
statements by Robert McNamara that it was only a matter of time before the 
war would end since half of Hanoi’s electrical capacity was destroyed. We 
would laugh at this since we were always working in Calcutta much below 
capacity and yet the city and the people carried on heroically. I realized at 
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that time that the US had no understanding of Asia and would always fail in 
political ventures there, as later and even contemporary events would prove. 

Ranga Rao and Parthasarathy kept our work going with their imagination, 
power, and dedication, and we obtained many new results. We wrote a letter 
to Harish outlining what we had done but he had already left on a tour of 
India under the auspices of the University Grants Commission. To illustrate 
how little the monumental stature of Harish-Chandra was appreciated in India 
at that time, his visit made absolutely no waves, while the visit of a certain 
cosmologist, who was staying as a special guest of Lal Bahadur Shastry, the 
Prime Minister, was played up in the newspapers. To be fair, things are not 
very different here in the US even today, as the headlines and coverage in 
even great newspapers like the New York Times have very little correlation 
to the scientific importance of the topics discussed. 

Harish came to Calcutta in January of 1965 and we arranged a visit to 
our institute. This visit was memorable for us and I have written about it 
elsewhere [9]: 


... My impressions of that encounter are still vivid in my mind. He was, to use 
a sports metaphor, at the top of his game, and was very impatient with the usual 
delays and other inconveniences typical in an Indian visit. The visit to Calcutta 
was a part of an extended itinerary arranged (I think) by the University Grants 
Commission of India. While waiting for the elevator to go to our offices I told 
him that our work in the preceding two years had led us to a possibly novel way 
of constructing (infinite dimensional) representations of complex semisimple Lie 
algebras and hoped that he would listen to what we had done and give his comments. 
He immediately responded by saying that the infinitesimal approach did not go far 
enough. I remember Parthasarathy turning to me and mutter in Tamil that I had 
better handle this tiger! | must say that I was also disturbed at his abrupt manner 
and so asked him whether, in view of this, we should even bother to proceed to a 
discussion! He immediately became very gentle and said that he would very much 
like to know what we had done. Our meeting was wonderful for us and gave a real 
glimpse into his view of the whole subject. Later on, during a lunch with Professor 
C. R. Rao and others, I asked him why the exceptional series was important—a 
question that was to be with him nearly all his life—and he answered that “‘Selberg 
has emphasized to me its importance and Selberg is always right.”” Neither I nor 
my friends had ever before met a person of such intensity and we were completely 
overwhelmed by his passion and power. He expressed his views on mathematics as 
well as everything else with a sharpness and focus that was, at least in our limited 
experience, unprecedented. 


The discussions with him in Calcutta were unforgettable. We were de- 
lighted that we had been able to do something interesting. The encounter 
generated a deep resolve within me to try to understand the method by which 
he reached the deepest results in the theory, namely, via the differential equa- 
tions. It occurs to me that in some sense we were very lucky to stumble onto 
this area and this complex of questions just ten years after his pioneering 
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efforts started in the early 1950s. It made possible that we could ask reason- 
able questions and discover new facts before the literature piled up and made 
progress very difficult. 

The next time I met him was in 1968 when I visited the Institute as an Alfred 
P. Sloan fellow. By that time I had an understanding of the construction of 
the discrete series and his lectures at the Institute were about the continuous 
Spectrum and the Plancherel formula. His lectures emphasized what he called 
the philosophy of cusp forms and his method of relating the asymptotics of the 
matrix coefficients of the various series of representations with the Plancherel 
formula. But the lectures had to be cut short because he suffered during 
the second term the first of his many heart attacks. He had been working 
incredibly hard without breaks for many years, and this pattern of life had 
caught up with him ina very unpleasant manner. Unfortunately the treatments 
were relatively primitive even in the US at that time but he did make some 
changes in his lifestyle—more vacations for a start and no work during the 
vacations. 

I had many opportunities to talk at leisure with him on his work and came 
to know him quite well. It was during this period that I once asked him if he 
would suggest some questions that I could look at. He did not want to do it 
mainly because he felt that I would be comfortable only with those problems 
that I myself had thought about. This advice to work on what mattered most 
to me was something I never forgot. From that time on I always tried to work 
on things that were important to me, no matter how humble they might have 
seemed to others. The idea of working with super-Schwartz spaces came 
to me at that time, although I did not do anything till Peter Trombi came 
to Los Angeles to lend a helping hand. The collaboration with Peter is one 
of the happiest in my life. We worked out the L'-version of the theory of 
spherical functions [10] and discovered the fact that the matrix elements of 
the discrete series decayed faster and faster as their parameter became larger 
and larger [11]. 

If Iam not mistaken, the monumental work of Jacquet and Langlands had 
just appeared. It was during that visit of mine that he spoke admiringly of 
this work, commenting on how far-reaching it was as they could obtain the 
Artin nonabelian L-functions by their methods. He thought, and that was the 
only advice he ever gave me, that perhaps I should study elliptic curves. 

I remember once a conversation in which he was telling me how difficult 
it was for him to learn new things. He said that every time he wanted to go 
a little deeper into algebraic geometry he was told that he should learn about 
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schemes. He said that this is like a patient who has an iron deficiency and is 
asked to swallow a pound of nails every day. It was clear that all his time was 
spent on his vision of harmonic analysis that required almost inhuman focus 
and concentration, excluding everything else. 

I did not meet with him very often in the years after 1970. One such occa- 
sion was at the Williamstown conference in 1972. He had already extended 
his philosophy of cusp forms, outlined in his Oregon Colloquium lectures, to 
the p-adic case, and obtained the connection between asymptotics of matrix 
elements and the Plancherel formula. He did this, both in the real and p-adic 
case through his wonderful theory of Eisenstein integrals and what he always 
called the Maass-Selberg relations. Initially he worked with the supercusp 
forms but later on was able to include all the discrete series. During the next 
years he had additional heart attacks and his general well-being was deterio- 
rating. He often spoke to me of his internal conflict, imposed by his health, 
whether to look for new discoveries or polish what he had already done for 
publication. He chose the former as attested by the masses of unpublished 
papers he left behind, most of them written only for himself. 

In 1982 I received a call from him asking me if I could go to Toronto to 
deliver his invited talk as he himself was forbidden to travel. I agreed and 
went to Princeton, spending a week discussing what I should emphasize in 
my presentation. He had planned to talk on the spectral theory of Whittaker 
functions. He had done the work in the 1970s, but when he started to organize 
his work for the talk, he discovered to his horror that the main technical result 
on which everything hinged on was incomplete: its proof contained a serious 
flaw. He told me that it was as if “the main beam in his house had fallen.” 
He worked night and day to repair it. He did succeed but the effort triggered 
another heart attack and he was advised against going to Toronto [12]. It was 
during this visit that I began to realize that unless he took total rest things 
would go bad. He himself was aware of this and told me that he had pushed the 
subject as far as he could, and that it was up to others to continue. The lecture 
I gave at the AMS meeting was not a success-how could it be otherwise? | 
spoke to him after the talk by phone and he consoled me saying that things 
would have been the same if he himself had given the talk. He felt that he 
had long ago ceased to have any meaningful interaction with the others in the 
field, and as a consequence, become too isolated. 

In the summer of 1983 I received a call from Lily that they would both 
very much like me and Veda to spend a couple of weeks in Princeton. We 
went and it was a wonderful time for all of us. I went for long daily walks 
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with him and we often had dinner together at their house. He had become 
very introspective and it was clear that his thoughts were far away. Veda 
gave him a small cross which she had picked up in the Vatican museum and 
he was very moved by it. I was distinctly uneasy when we came back to 
Los Angeles. Nevertheless I was unprepared for Helgason’s phone call on 
October 16, 1983 that he was no more. 


8.3 Some reflections 


Counting from the time I first met him, it is almost fifty years now. Memories 
are slowly fading, the colors no longer as vivid as when things first happened. 
I feel that I should set some of my recollections down before the pitiless flow 
of time erases everything. 

His lectures were very inspiring, but only to those who understood his 
philosophy. He tried hard to explain but he was so far ahead that he was able 
to reach out only to a small number of people in the audience. The situation 
was essentially the same when he gave the AMS Colloquium lectures in 
Oregon in 1970. Bott was the other lecturer in that meeting and Harish was 
very cognizant of the difference in their lecturing styles. Bott had a natural 
way of explaining things very simply and was a great raconteur. Harish, 
with his ascetic temperament, and fresh from a second heart attack, had no 
sense of the trivial, and his lectures overwhelmed the general audience. The 
lectures were published [13] in the Bulletin of the AMS and contained a bare 
outline of amonumental work parts of which are still buried in his unpublished 
manuscripts. 

The last part of his lectures and the Bulletin article deal with the p-adic 
groups and represent his first attempt to extend his theory into this domain. 
However this is somewhat misleading. Already in 1965, when he met us 
in Calcutta, he told us that one should not stop with the real groups but go 
to the p-adic groups as well, and ultimately, to the adelic groups and try to 
understand and extend the work of Hecke and Siegel in the context of general 
semisimple groups. He was mathematically close to Selberg and Langlands 
at that time. When Langlands came up with his epoch-making ideas that are 
now broadly described as the Langlands program, he was one of the first to 
understand and emphasize their scope and importance. But he himself was 
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more conservative, preferring to rely on his own intuition to probe the world 
where arithmetic and Fourier analysis interacted at a very deep level. 

His asceticism and single-mindedness made it very difficult for most people 
to get close to him. In a world of high flying entrepreneurs of mathematics 
who behaved like prima donnas and jet-setting conductors, he was the ultimate 
loner who trusted only his own results and insights. In his mature years there 
was only one collaboration, the one with Borel on arithmetic subgroups of 
algebraic groups which led to their famous paper in that subject. This aspect 
of his professional life became more pronounced in his later years, and had 
the effect of isolating him from his contemporaries. Like Siegel before him, 
he saw his work fade into the background as a huge monolith, waiting for the 
future generation to understand its beauty and power and ability to lead to 
further discoveries. One cannot say however that this was unexpected. The 
works of all the great figures of the past have suffered this fate, and he was 
no exception. Unlike many he would never trumpet his ideas and never insist 
that the world take notice of it. 

As the years passed by, the younger generation did not go back to his ideas. 
Perhaps his methods were too difficult for anyone else to use. Although this 
might have been frustrating to him, his philosophical approach to life and 
work gave him serenity and security. I remember getting very depressed 
during the Corvallis conference on automorphic forms in 1977 when no one 
even mentioned his name. This must have upset some others also because 
I still remember a lecture in which Langlands chided the attitudes of the 
younger people by pointing out that they were too young to know that there 
was a time when there was no discrete series. 

It is a fact that he made little effort in the introductions to his papers to take 
the reader along, and that the main results came only after gigantic collections 
of lemmas. However, in my opinion, the sequence of eight papers [14] leading 
to the construction of the discrete series form a self-contained whole of great 
beauty. He must have felt this deeply because of what he said on the occasion 
of a ceremony at the Indian embassy in Washinton D. C., honoring Indian 
savants: 

vaeenlie study of continuous groups was started by Sophus Lie. Among these 

the ones with finite volume are called compact. Around 1925 Hermann Weyl 

extended the work of Frobenius and Schur to compact groups and discovered a 

beautiful formula which is now called the Weyl character formula...] joined the 

field around 1950 and my early papers were concerned with algebraic methods for 
dealing with infinite dimensional representations. About the same time Laurent 


Schwartz in France had invented the notion of distributions. It turned out that, in 
order to handle certain differential equations on the group, his distributions were 
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just the right concept. With their help it eventually became possible to prove an 
analogue of Weyl’s character formula for all semisimple groups and extend to them 
the theory of Fourier transforms. 


In spite of all the talk about the orbit picture for representations, it was in 
these papers that the correspondence between orbits and representations was 
fully established, in the most important case of semi simple groups. The 
technical power exhibited in these papers, combining analysis, algebra, and 
geometry, is truly unbelievable. Other approaches have been developed to 
reach the discrete series, but none has the directness and simplicity of his. 

It is hard to tell from his writings precisely when he started to think in 
a global way about representations and their arithmetic aspects. I am sure 
it already started in the early 1950s when his close friend Mautner began 
working with representations of SL(2. Q,,). Mautner came often to Princeton 
to talk with him, and it is almost certain that their discussions touched on 
the p-adic groups. Parallel to these impulses, his interest in number theory 
probably goes back to the late 1940s when he had attended lectures of Artin on 
classfield theory, stimulated later by his contacts with Chevalley. He himself 
had lectured in Columbia on it and also on Siegel’s theory of quadratic forms. 
Borel told me that sometime in the early 1960s Harish excitedly phoned him 
from a vacation outing about some striking things he had discovered about 
representations of p-adic groups. By the time of the Williamstown conference 
in 1972 he had already extended to the p-adic groups the principle he had 
discovered for real groups that relates the asymptotics of matrix elements to 
the Plancherel measure. In these lectures he formulated his attitude to the 
p-adic theory. He called this approach the Lefschetz Principle, which insisted 
that all the primes be treated on an equal footing. For him this meant that 
(roughly speaking) whatever is true for real groups should be also true for the 
p-adic groups. His entire work on the p-adic groups is animated by this view 
[15a, 15b]. 

With declining health he became less and less interested in publishing 
his results and simply used the format of announcements. For a man of his 
achievements the honors accorded to him were few and far between. This 
is not surprising since he neither promoted himself nor allowed others to do 
it for him. Occasionally there were suggestions that his methods were very 
elementary, perhaps implying that the complexity and length of his proofs 
arose from this. In my opinion such comments are widely off the mark. He 
had a vision of the entire theory of harmonic analysis at an extraordinarily 
early stage and spent the rest of his life bringing down that dream to reality. 
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He was a man ina great hurry and he got to where he wanted to be by whatever 
means he knew. It was Andre Weil who said it best when he said that “of all 
people he knew, there were only two for whom there was no such thing as a 
technical difficulty, namely, Chevalley and Harish-Chandra.” 

I remember one occasion in 1968 very vividly. He, who was a creature of 
regular habits, was suddenly forced to eat lunch at the Institute one day, and 
a bunch of us accompanied him to the Institute cafeteria which was housed 
at the top floor of the building then. He shared the table with us and the 
discussion naturally turned to one of his favorite topics—how an artist gives 
shape to his dreams. He said that in his experience one has a dream that is 
perfect and crystal-clear but in the process of bringing it down to earth the 
dream gets mangled so much that at the end one does not know whether to feel 
relief or elation. I have pondered about this remark many times and marveled 
at his imagination, which allowed him to dare to dream those great dreams 
of his, and his power and persistence that made them a reality no matter how 
long it took. 

More than anyone I have come across, he had profound emotional ties with 
his mathematical quest, perhaps partially explaining his many heart attacks 
some of which came when he was at crucial stages in his work. Once in 
1968 I mentioned to him that some of the work can be simplified somewhat. 
He did not say anything and I must say I was puzzled a little as I felt this 
was uncharacteristic of him. But in his lecture a couple of days later he 
acknowledged his tardiness but blamed it on the fact that he was heart broken 
that my method bypassed a very beautiful theory of his on certain principal 
values on the Lie group and its Lie algebra. He then continued, beaming, that 
he has found a new use for it! 

In all my conversations with him he almost never commented on his 
feelings about his own work and so, when he did so, they stand out very 
clearly. One of these occasions was about his work on arithmetic subgroups 
of semisimple groups. He had wanted to prove that these have finite co- 
volume but he was stuck on generalizing Siegel’s ideas from the context of 
symplectic groups to the general case, for nearly a month. The main dif- 
ficulty lay in relating the fundamental domains of the group to those of a 
smaller subgroup. The key idea came to him while he was out for a walk 
near Columbia University. When he came home he shared his excitement 
with Lily. He told her that “the had just done some thing that would even 
have pleased Poincaré.” Lily told me that she remembered at the time being 
struck by his wording. This and the finiteness theorems of orbits gave him 
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great satisfaction as achievements in a line of thought that had been touched 
upon by Gauss, Lagrange, Poincaré and Siegel. Such exultations were very 
uncharacteristic of him. 

The real mystery of his creative process, or that of any great artist for that 
matter, is of course impossible to track down. Indeed there are some instances, 
some of which I know from personal experience with him, and others by 
what he himself told me, which are truly startling. The time I mentioned 
above was also the period when he had run aground in his quest for the 
discrete series, and his difficulties led to a very serious nervous breakdown. 
Treatments involving electrical shocks were considered but were finally not 
advised for him. He became a patient of Dr. Nathan Kline, a pioneer in the 
use of chemical medications for depression. Dr. Kline’s treatment worked for 
Harish over time. When he started working again, his mind was crystal clear, 
and he reached his goal by methods entirely different from those that he had 
been trying before his crisis. The announcement [16] in the Bulletin of the 
AMS describing his results is an extraordinary one, giving the blueprint for 
the great cycle of papers mentioned earlier [14]. A similar thing happened 
to him during 1968 when he had the first of his many heart attacks. He was 
stuck at a crucial point in his theory of p-adic orbital integrals before the 
heart attack, but after he came home from the hospital, he went around his 
difficulties by different methods and reached his goals almost effortlessly. I 
remember his telling me this when I visited him while he was convalescing 
at his home after returning from his hospital stay. 

In spite of his protestations he had a deep feeling for fundamental physics 
and would often express it in informal discussions. He used to compare the 
discrete spectrum of a compact arithmetically defined homogeneous space to 
the phenomenon of diffraction. His treatment of the continuous spectrum is 
a variation of scattering theory where past and present are replaced by the 
various chambers defined by the Wey! group with the intertwining operators 
playing the role of the S-matrices. The Maass—Selberg relations then express 
the unitarity of the scattering matrices. Nevertheless, perhaps even because 
of his early experience in physics, he would always express his conviction 
that fundamental work in physics required a certain intuition that even the 
most creative mathematicians did not possess. “You may have a Porsche but 
unless you are careful you may drive it into a ditch” was the way he put it to 
me once. 

He never considered the problem of determining the unitary duals as central 
to his vision. However he was puzzled by the existence of the exceptional 
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series which played no role in harmonic analysis on the group and framed his 
puzzlement by asking “why did God create the exceptional series?” He had 
discovered that the representations that occur in the regular representation 
are precisely the tempered ones and in his later years he had come to the 
conclusion that the ones occurring in the spectrum of a homogeneous space 
were precisely the ones that were tempered with respect to that homogeneous 
space. He did not have time to develop this in sufficient detail. 

There were very few instances in his writings that were truly introspective. 
One was the talk he gave in the ICM at Moscow in 1966 [17]: 


... Although the case of real groups is beginning to be fairly well understood. 
our knowledge of the p-adic groups is still very rudimentary. Nevertheless there 
appears to be a deep-seated analogy between these two cases. In my opinion, one 
of the major tasks confronting us. 1s to try to discover and comprehend the reasons 
for this similarity. Once local Fourier analysis is wel] understood. one would have 
to globalize the problem by going over to the adele group. It is this global setting, 
which seems to provide the right framework for the understanding of the work 
of Hecke and Siegel, that the deeper connections between Fourier analysis and 
arithmetic are likely to emerge. ... 

The algebra 3 of bi -invariant differential operators plays a very important role in 
the real case. However, as we have already observed. there does not seem to exist 
any p-adic analogue for 3. Nevertheless it appears likely that all the final results 
of Fourier analysis continue to hold, after some slight reformulation. for the p-adic 
groups. The unravelling of this mystery, would, in my opinion, be an important 
achievement. 


On the occasion of the 80" birthday of Dirac he had this to say [18]. 


... | have often pondered over the roles of knowledge or experience on the one 
hand and imagination or intuition on the other, in the process of discovery. | 
believe that there is a fundamental conflict between the two, and knowledge. by 
advocating caution, tends to inhibit the flight of imagination. Therefore, a certain 
naivete, unburdened by conventional wisdom, can sometimes be a positive asset. 


Harish-Chandra understood that semisimple groups were objects of com- 
plete perfection in an imperfect world, perhaps like the music of Mozart. One 
needed methods of great power as well as supreme refinement to penetrate 
their secrets. These he possessed in great abundance, and they powered his 
life-long quest, through declining health in his last years. He discovered the 
fundamental concepts and principles: characters and their regularity, tem- 
peredness and the Schwartz space, Maass-Selberg relations, and so on. He 
formulated the Lefschetz principle and tried to continue the work of Siegel to 
all semisimple groups. With his power he combined a feeling for what was 
truly beautiful and an awareness for the infinite delicacy of the fundamental 
results. Taken as a whole, his work, a colossal structure informed throughout 
by his vision, is unsurpassed in twentieth century mathematics. 
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I would like to complete this essay with a very brief sketch of some of the 
contours of his work. I cannot think of a better way than to begin with 
some insightful, even moving, comments on his work from some friends and 
colleagues who had known him intimately and for a long time [19]. 


(Borel)... | heard several times André Weil say that he knew only two mathemati- 
cians for whom technical difficulties simply did not exist, namely, Chevalley and 
Harish-Chandra. However this was technical power at the service of the highest 
goals. | would be hard put to give another example of a work by a single au- 
thor extending over 2000 pages at such a consistently high level. or should I say 
relentlessly high level, devoted to one main goal, broadly conceived. . . 

... Speaking of exceptional technical powers at the service of high goals brings 
to mind C. L. Siegel, whose work Harish-Chandra held in great esteem. Once, 
in a conversation, A. Weil and I realized that, for a number of years. we had both 
viewed Harish-Chandra as a kind of spiritual heir to Siegel by his mathematical 
style, power and concentration on very difficult, basic, questions. .. 

... He often said that semisimple Lie groups are so perfect that they must have a 
divine origin... 


In mathematics. Harish’s life was indeed a search for fundamental general 
theorems, with the beliet that they should be beautiful, and combine to harmonious 
theories. He pursued this quest with awesome single-mindedness, persistency, 
power, and success. 


(Helgason) ... His magnum opus within representation theory, appropriately 
called “Harmonic Analysis on Semisimple Lie Groups,” stretches from 1951 (when 
he considered the first special case) to 1976 when the general Plancherel formula 
was completed. The cumulative nature of this work of twenty-five years leads one 
to characterize it as monumental; yet even this does not do justice to the courage 
and pioneering efforts, conceptual and technical, which were needed to overcome 
the formidable obstacles along the way... . 


(Langlands) ... His greatest achievement, the construction of the discrete series, 
still lay before him, and I have always reckoned it my great good fortune to have 
met him before 1964, when he proved their existence, and to have been close to 
him at that time. The discrete series are the keystone of his own theory of harmonic 
analysis and, it seems to me, to all later developments in representation theory. . . 


... It is difficult to communicate the grandeur of Harish-Chandra’s achievements. 
The theory he created still stands—if I may be excused a clumsy simile—like a 
Gothic cathedral. heavily buttressed below, but, in spite of its great height, light 
and soaring in its upper reaches, coming as close to heaven as mathematics can. . . 


(Mostow) ... Each great work in mathematics as in other forms of creativity, has 
its own uniqueness. But I believe it is accurate to say that no mathematician of our 
time has successfully completed so long and so arduous a climb, solo. 


His interest in infinite dimensional representations of semisimple Lie alge- 
bras was generated by a suggestion from Dirac, his supervisor in Cambridge, 
that he take a look at the Lorentz group. He proceeded at first by the in- 
finitesimal method, as was natural for a physicist. He must have realized that 
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his methods had an immense range of applicability. Starting from around 
1951 he spent the next thirty years and more in building a theory of harmonic 
analysis on semisimple groups by methods which never used classification 
and were so beautiful because they were so general. 

Infinite dimensional representations of semisimple Lie algebras had al- 
ready entered the picture in a decisive and dramatic manner in the question 
of constructing by general methods the semisimple Lie algebra correspond- 
ing to a Dynkin diagram and the closely related problem of constructing the 
irreducible finite dimensional representation with a given dominant integral 
highest weight. The methods that he developed were the ones that would start 
the theory of Kac-Moody algebras some years later. Some of his results were 
also obtained independently by Chevalley, in particular the relations, beyond 
the commutation rules determined by the Dynkin diagram, nowadays called 
the Serre relations. This theory should be appropriately called the representa- 
tion theory of the pair (g, 5) where g is acomplex semisimple Lie algebra and 
h is a Cartan subalgebra. The representations concerned are highest weight 
representations. In general they are infinite dimensional and not related to 
any representation of the group, but in one important case, namely that of a 
hermitian symmetric space, some of the highest weight representations are 
also group representations. I shall come to this a little later. 

The representation theory relevant to semisimple groups is that of a pair 
(g, €) where g is a real semimple Lie algebra and & is a maximal compact 
subalgebra. For these Harish-Chandra introduced the category of represen- 
tations consisting of those for which the restriction to § decomposes as the 
sum of irreducible finite dimensional representations of €. Let us call these 
g-modules €-finite. By using the classical theory of invariants he proved 
the completely unexpected result that in any finitely generated module of a 
semisimple Lie algebra g which is €-finite, the isotypical subspaces are finite 
modules for the center 3 of the enveloping algebra of g. He then created his 
theory of analytic vectors which allowed him to lift his Lie algebra theory to 
a theory of representations of a real semisiple Lie group G in a Banach space. 
This would lead to the finiteness of the K -multiplicities of a UIR of G (with 
finite center) when restricted to its maximal compact subgroup K, the type I 
property of G, and the existence of characters for irreducible Hilbert space 
representations with an infinitesimal character, in particular for UIRs. The 
existence of the character is a consequence of the following result: there is 
an integer N > | such that for any irreducible unitary representation 2 of G 
and any irreducible representation 0 of K, the multiplicity [7 : 0] with which 
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0 occurs in 7 satisfies the estimate 
[7 : 0) < Ndim(0). 


It follows from this that for any test function f € C™(G) the operator 


m(f):= / f (x)m(x)dx 
G 


1s Of trace class and 


O7(f) := Tr(a(f)) 


is a distribution in the sense of Schwartz, satisfying an estimate 


lOr(AI < Now Fl 


where uw is an element of the enveloping algebra and || - ||, is the L'-norm. 
The character is a distribution which is invariant under inner automorphisms 
(hereafter called simply an invariant distribution) which determines the rep- 
resentation up to equivalence. If x, is the infinitesimal character of 2, the 
character is an invariant distribution for the center 3 of the enveloping algebra 
with eigenvalue x,,, namely, 


0(Z)On = Xn(Z)On (z € 3). 


The character is in this sense an invariant eigendistribution. 

It was by the infinitesimal method that he obtained his famous result that 
any irreducible €-finite representation of the group G is a subquotient of a 
principal series representation, as also the result that any irreducible repre- 
sentation of the Lie algebra which 1s infinitesimally unitary is the K -finite 
part of a UIR of the group. He never came back in his later years to the 
infinitesimal method; but later, when it was revived by ideas from Bernstein- 
Gel’ fand-Gel’fand, Enright, Varadarajan, Wallach, Zuckerman, and others, 
he felt very gratified. He told me that he felt like a gardener who planted a 
seed and was fortunate enough to see it grow into a mighty tree. 

From then on his emphasis shifted dramatically, and Fourier analysis came 
to the foreground, with characters, matrix elements, and their differential 
equations, beginning to play a central role. This is where he left behind most 
of his contemporaries. No one understood as well as he did that as the groups 
are noncompact, one has to use differential rather than integral operators to 
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penetrate deeper. He had determined the Plancherel formula for all complex 
semisimple Lie groups exploiting brilliantly the fact that they have only one 
conjugacy class of Cartan subgroups [20a]. The point is that the singular 
elements of a Cartan subgroup form a variety of complex codimension | 
and hence of real codimension 2, and so the orbital integrals, smooth on the 
regular set, extend smoothly to the whole Cartan subgroup. Hence one can 
start with the limit formula whose significance had already been underscored 
by Gel’fand and Naimark, and take its Fourier transform, which gives the 
Plancherel formula. On the other hand, if the group is real and we fix a 
Cartan subgroup A, the orbital integrals are smooth on the regular set but in 
general have jumps across the singular points. Let 


I;(a) = i f (xax7')dgjax F(a) = Ag(a)I¢(a) (ae A’) 
G/A 


where A’ is the regular set of A and f is asmooth test function on G. Here A, 
is the Weyl denominator defined with respect to a positive system of roots. 
Let 0(w,) be the differential operator corresponding to the same positive 
system. Then F’, and all its derivatives have only jump discontinuities across 
singular points of A. Moreover 


Ve Ole eA. 


is independent of the positive system. Remarkably, V4(//) extends continu- 
ously to the whole Cartan subgroup and for some constant c independent of 
f we have 

cf (1) = Vay) (Ll) = (0(@ 4) Fy) (A). 


This is the limit formula. However c is 0 unless the Cartan subgroup is 
fundamental, i.e., has a compact part of maximal dimension, in particular 
if the Cartan subgroup is compact. Because of the jumps of Fy one cannot 
immediately take the Fourier transform of the limit formula. I shall explain 
later how he overcame this obstacle by working over the Schwartz space of 
the group. 

He worked out the Plancherel formula for SL(2, R) in a beautiful little 
note in the Proceedings of the National Academy [20b]. It is clear from his 
method that character values were going to play a fundamental role in the 
general theory and that the discrete series was the key to everything [21]. The 
fact that the various series of representations corresponded to the characters 
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of the various Cartan subgroups must have made it clear in his own mind 
that discrete series arose only when there was a compact Cartan subgroup. 
Although all these connections can only be dimly perceived by a reader, they 
must have resonated within himself with great clarity as one can judge from 
the force and single-mindedness with which he pursued these ideas. 

The construction of the discrete series is arguably among his greatest 
achievements. The discrete series form the key to the construction of the 
irreducible representations of a semisimple group that occur in the regular 
representation, and hence for the problem of obtaining an explicit description 
of the Plancherel measure. 

For a unimodular Icsc group G and a UIR x of it, we say that m is in the 
discrete series if the matrix elements 


fo 1X > (1(x)¢, Vv) 


are in L°(G). We also say that 7 is square integrable. It is actually enough if 
just one fg y iS square integrable for some pair @ 4 0, wy 4 0. Let w be the 
equivalence class of 7. One then has the orthogonality relations in L?(G) 
given by 

(fou Sow) = FOG ¢ )(w', ) 
where d(w) > 0 is a constant depending only on w and is called the formal 
degree of w. In particular, if @, y are unit vectors, 


ae 
[amon dx = FTES); 


If G is compact and we normalize dx by requiring that fe d x= lpihend (a) 
is the actual degree of the representation. In the general case these w are 
precisely the ones that are direct summands of the regular representation and 
d(q) is the mass of w for the Plancherel measure. In all cases known (real 
and p-adic semisimple groups) the Haar measure can be normalized so that 
all the d(w) are integers, but I do not know how far this can be pushed for 
general unimodular Icsc groups. 

In the general case perhaps more can be said about how far the dis- 
crete series go towards getting the UIRs—a question that is certainly worth 
exploring—but for semisimple groups the importance of the discrete series 
arises from the fact that even if G does not have a discrete series, one can 
construct UIRs by inducing from parabolic subgroups which are semi direct 
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products of the form P = MU where M is reductive with a discrete series 
and U is unipotent, using discrete series representation of M. Thus getting 
the discrete series became the over-arching concern for Harish-Chandra. 

Already for groups G for which G/K has a G-invariant complex structure 
and rank(G) = rank(K ) (so that a maximal torus of K is a compact Cartan 
subgroup of G), a case that includes SL(2, R) where G/K is the Poincaré 
upper half plane, he had constructed highest weight modules for the Lie 
algebra which integrate to representations of the discrete series, calculated 
their characters on the compact Cartan subgroup, and their formal degrees. 
The formulae for the character and formal degree were exactly analogous 
to the celebrated Wey] formulae [22]. However this method gives nothing 
when G/K is not hermitian, and even in the hermitian case, only a part of the 
discrete series could be so obtained, namely, not all characters of the compact 
Cartan subgroup would correspond to these highestw eight representations. 
This example must have convinced him that the road to the discrete series 
lay through the explicit construction of their characters, thus bringing his 
method in complete harmony with that of Weyl. But characters of infinite 
dimensional representations exist initially only as distributions, and it was 
not immediately clear how to speak of their values. At some point he started 
to believe that the characters were in fact functions. 

Although there is more to a character than the fact that it is an invariant 
eigendistribution, he had the great insight that so far as its local structure is 
concerned, only the fact that it is an invariant eigendistribution is critical. It 
took him nearly a decade to overcome all the technical difficulties and prove 
that all invariant eigendistributions are functions. More precisely, they are 
analytic functions on the regular set, these functions are locally integrable 
on the entire group, and the eigendistributions are precisely the distributions 
defined by these functions. Let © be an invariant eigendistribution. Let A 
be a Cartan subgroup, ©, the restriction of the character to the regular set 
of A, and Ay. a4. V4 be defined as before. Then the differential equations 
satisfied by © around the singular points imply among other things that 


Py = AnO, 


extends analytically to the open set A’(R) where no real root is equal to I. 
In particular, if A = B is a compact Cartan subgroup, then ®» is anaytic on 
all of B and is a linear combination of the Wey! transforms of a character of 
B. The analogy with the Weyl character formula is clear. 
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For any action of G ona space X one can formulate the notion of invariant 
eigendistributions and it is not always the case that they are functions. There 
are two special actions, the one on the group itself by inner automorphisms, 
and the corresponding one on the Lie algebra. On the Lie algebra one has 
the algebra of constant coefficient invariant differential operators playing the 
role of 3. For both of these actions the regularity theorem is true. But what 
is decisive and absolutely astounding is that the invariant eigendistributions 
on the group could be obtained (after adjusting for the jacobian of the expo- 
nential map) by exponentiation of the invariant eigendistributions on the Lie 
algebra. The tempered invariant eigendistributions on the Lie algebra can 
be constructed by Fourier analysis as the Fourier transforms of the invariant 
measures On the orbits, and their exponentiation gives the discrete series on 
the group. 

In the general picture the various Cartan subgroups of G give rise to the 
various series of representations parametrized by the dual groups of the Cartan 
subgroups, and so the discrete series would come from the characters of a 
compact Cartan subgroup. The formal dimension formula obtained in the 
calculations with the hermitian symmetric case pointed to the fact that the 
characters of the discrete series would correspond to the regular characters 
of the compact Cartan subgroup, namely those which are in general position 
with respect to the Weyl group. The condition of temperedness on the Lie 
algebra would translate into the condition that the character values be bounded 
globally by |Dg|~'/? where Dg is the discriminant of the group. 

The fundamental theorem of the characters of the discrete series can be 
stated now. For simplicity | assume that G is the real form of a complex 
simply connected semisimple Lie group G,. with a maximal compact U such 
that UG = K isamaximal compact of G. It is assumed that G and K have 
the same rank. Let B be a maximal torus of K which is then also a maximal 
torus of U. Then the Weyl group W,. of G,. operates on B and so on B; a 
character € € B is called regular if its stabilizer in W.. is trivial. We write 
W, for the subgroup of W, that comes from G. Select a positive system of 
roots for (G. B) and let A = Ax be the corresponding Wey] denominator. If 
be=sie(Biythen B ~ L where L is a lattice in enill)D" we write log € 
for the element in L corresponding to the character € € B. If H, € b, 1s the 
element dual to the root a then (log €)(,) is real. Let 


w=|[H, e€) =sign a) = | | dogé)(Ha). 


a>0 a>0 
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Let 
q = (1/2) dim(G/K). 


Finally, let €:(G) be the set of equivalence classes of the discrete series 
representations of G. 

To specify an eigendistribution on the group it is necessary to describe it 
on each one of a complete set of mutually nonconjugate Cartan subgroups. 
Harish-Chandra discovered a surprising variant of this procedure for describ- 
ing the eigendistributions which would later prove to be the characters of the 
discrete series. He specified them on a compact Catan subgroup and imposed 
a bound at infinity on the other Cartan subgroups. 

The first part of the construction of the discrete series is then the following 
theorem which specifies the invariant eigendistributions: for each regular 
Ee B there is a unique invariant eigendistribution @; on G such that 


(i) On B’ := BMG’ we have 


d Ss 
Oy m (-1)fe(g) Ee 
B 


(ii) supg |Dg|!/|@e| < 00. 


In (ii) Dg is the discriminant function on G; its restriction to any Cartan 
subgroup A is +A, where A, is a Weyl denominator for A. Thus (ii) is the 
same as saying that on each Cartan subgroup A we have 


|Aa(@aJOz(a)i}<C (ae ANG’) 


for some constant C, G’ being the set of regular points of G, namely the 
points where Dg does not vanish. In practical terms A,4(a)©,i(a) is a sum 
of characters of A (locally) and (ii) says that only the characters which are 
bounded enter. This is the decisive condition that allows a continuation of the 
character from the compact Cartan subgroup to the other Cartan subgroups. 
The reader should also note that the sum in (i) is only over WG; if G itself is 
compact, then WG = W, and (i) is the Weyl character formula. 

The second part of the construction is the following theorem: for each 
regular character € € B, there is a class w(&) € &2(G) such that Ox is 
the character of w(&); the map € +> w(€) is surjective onto €)(G), and 
w(&|) = w(&2) if and only if there is an element s € Wg such that E> = &). 
For the formal degree we have 
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d(w(&)) = w(€) = | [(ogé)(H,). 


a>) 


The analogy with the Weyl formula cannot be any closer. It is also true, 
although its significance becomes clear only when we go to the continuous 
spectrum, that each irreducible of K occurs only in finitely many discrete 
series representations, and the trivial representation of K never occurs. The 
Harish-Chandra character and formulae are the most beautiful formulae in 
infinite dimensional representation theory. 

To go from the eigendistributions to the discrete series representations is 
a long path. The essential point is to show that the Fourier components of 
the ©z (with respect to right translations by K ) are in L?(G). These Fourier 
coefficients are K -finite eigenfunctions, and, because of their K -finiteness, 
satisfy certain additional elliptic equations, and so are analytic functions on 
G. To prove that they are square integrable on the group requires entirely 
new ideas on the asymptotics of eigenfunctions. 

In his earlier work on spherical functions Harish-Chandra had discovered 
the method to determine the asymptotic behavior of the spherical eigenfunc- 
tions. He now applied the same principles in this new context. However the 
problem is complicated by the fact that the differential equations now admit 
solutions that grow too fast at infinity and have nothing to do with harmonic 
analysis on the group. For example, the matrix elements of both the finite 
dimensional and discrete series representations satisfy the same differential 
equations! To single out the relevant solutions Harish-Chandra introduced 
the fundamental inequality, the famous weak inequality, that the solutions 
should satisfy. 

The idea is very simple. We imitate the situation in the spherical case but 
to have full generality we work with matrix spherical functions or functions 
f with values in a finite dimensional Hilbert space on which K has a unitary 
action t from both left and right, satisfying the sphericality condition 


f (Kixka) = ky-f (x)-kg = t(ki) f(x) t (ke) 


and the usual differential equations. f is said to be t-spherical. If V is 
one-dimensional and T is the trivial representation we get the usual spherical 
functions. We now note that the jacobian in the polar coordinates correspond- 
ing to the decomposition G = KCI(A*)K is of the order of e?? where 2p is 
the sum of positive roots. In the actual spherical case the eigenfunctions i 
are just on the verge of being square integrable, namely that e? f has at most 
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polynomial growth. This condition is taken over to the general case: 
|f(a)| < Ce POF (1 + || logal|)” (C2020) 


A t-spherical eigenfunction with this property is said to satisfy the weak 
inequality. 

To see how f behaves at infinity on G, let G = KAN be an Iwasawa 
decomposition and A* the positive chamber in A defined by NV; AY =vexpia® 
where a = Lie(A) and a* is the positive chamber where the roots belonging 
to N are > 0. Then it is enough, in view of the Cartan decomposition 


G = KCI(A*)K, 


(CI means closure)to examine what happens at infinity on A*. Now there are 
several ways of going to infinity on A*; indeed, if F is a subset of simple 
roots, then one can insist that only the values of the roots from F go to infinity, 
while the remaining simple roots stay bounded. The differential equations 
satisfied by f then have the remarkable property that when we neglect all 
terms that vanish at infinity (for this choice of F), they become the spherical 
differential equations on the reductive group M (F ) defined by the simple roots 
outside F. In this way the original eigenfunction f is approximated (with a 
rapidly decreasing error) very well by linear combinations of eigenfunctions 
on M(F). Harish-Chandra calls this the constant term fr of f of along 
F. It is essential for this approximation that the eigenfunction f satisfy the 
weak inequality. For these the approximation by the fr gives a satisfactory 
theory. This method applied to families of eigenfunctions coming from the 
various series of representations is the basic tool in the study of the continuous 
spectrum. 

To return to the the Fourier coefficients of the ©;, it is now a question 
of showing that they satisfy the weak inequality. This turns out to be a deep 
question, but once it is taken care of, the analysis sketched above would apply 
to them. For f to be in L*(G) the condition is that fr = 0 forall F + Q, or, 
as Harish-Chandra calls it, f is a cusp form.. If the eigenvalue of f is defined 
by a regular character, a condition satisfied by the Fourier coefficients of Oz, 
one can show that fr = 0 for all F 4 @ so that f isa cusp form. Those 
who are familiar with some structure theory will recognize that the F define 
parabolic subgroups (cusps) and that the approximation of f is the constant 
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term of the expansion of f around that cusp. There is thus a clear analogy 
with the theory of automorphic forms. 

It remains to show that the entire discrete series has been obtained. For this 
one can proceed as Wey] did in the compact case by reducing the analysis to B 
by integrating over the orbits. However there are other Cartan subgroups and 
their presence implies that the normalized orbital integrals are not smooth 
but have jumps. Only when f has the property that its orbital integrals 
over the noncompact Cartan subgroups are 0 can one come down to B. Such 
functions are actually cusp forms and so one cannot operate inside C°’(G) but 
in the Schwartz space C(G) introduced by Harish-Chandra. Since elements 
of the Schwartz space are nor in general in L'(G), their integrability on the 
conjugacy classes is no longer a given, but has to be proved. Once this is 
done, the completeness proof can be carried out. 

For the complete story of Fourier analysis on semisimple groups that in- 
cludes the continuous spectrum, there still remains the principle, which he 
first uncovered in his work on spherical functions, that the coefficients of the 
asymptotic expansions of the eigenfunctions of the continuous spectrum de- 
termine the Plancherel measure in a rather simple manner. For eigenfunction 
expansions on a half-line this goes back to Wey! [23]. For the special case of 
the spherical continuous spectrum see [24], where the principle is described 
as “reminiscent of a result of H. Weyl on ordinary differential equations.” 
The analogy becomes very close when G/K has rank | and is in fact the 
beginning of the analogy with scattering theory alluded to earlier. 

The spherical eigenfunctions are given by the integral 


g(r, x) =| eA—p Hk) ap 
Ks 


where H is the function on G defined by 
PE hexpel (Nn (xeG, keK, H(x)EeaneN). 


So we have a family of eigenfunctions parametrized by linear functions A 
on a = Lie(A) where G = KAN is an Iwasawa decomposition and 29 is 
the sum of the positive roots (restricted) (I use the notation of [24]). By the 
method of perturbation analysis of the differential equations at infinity on G 
discussed briefly earlier one can show that at infinity on At when all roots 
go to infinity, we have 
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gd, ay~ as c(sajeoMles a) 


s 


where the sum is over the Weyl group and A is pure imaginary (unitarity con- 
dition). The function c is meromorphic but never 0 when A is pure imaginary, 


Je(sA)| = [e(A)| 
for all s and all pure imaginary A, and the Plancherel measure is 
Jo(A)|7da. 


Notice that because of the approximation above, if one form a wave packet 
Ou) = i uUu(A)p(A, x)dd 
(-1)!/2a4 


then we can, at least to a first approximation, approximate the wave packet 


on At by 
 f etsrrume™™raa 


and begin to use classical Fourier analysis on A. 

In the general case Harish-Chandra uses vector-valued functions with val- 
ues ina double module for K and parabolic subgroups other than the minimal 
one for defining integrals analogous to the above one. The spherical integral 
is clearly analogous to the Eisenstein series one encounters in the theory of 
automorphic forms, see [25] where he clearly explains this analogy in the 
introduction; indeed, this is one of the few places in his entire work where he 
has made a genuine effort to lay bare the intuition and analogies behind his 
approach. For this reason he called the integrals generalizing the spherical 
ones the Eisenstein integrals. The asymptotics of the Eisenstein integrals and 
the precise formulation of their relation to the Plancherel measure, not only 
in the real case but in the p-adic case as well, are his greatest achievements 
in the realm of the continuous spectrum. Even a brief discussion of this is 
beyond the framework of this essay. 

! am aware that this sketch hardly does any justice to the majestic devel- 
opment in the cycle of papers mentioned earlier [14]. For details I ask the 
reader to go to my introduction to his collected papers or to [26a, 26b]. In ad- 
dition, for a good reference to the evolution of Harish-Chandra’s ideas in the 
nearly thirty years since he died, see the articles in the legacy volume cited in 
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[9]. For the work of Van den Ban, Delorme, Flensted-Jensen, Schlichtkrull, 


and others on the harmonic analysis of semisimple symmetric spaces, see 
{27a, 27b]. 
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V.S. Varadarajan 
Reflections on Quanta, Symmetries, and Supersymmetries 


Unitary representation theory has great intrinsic beauty which enters other parts of 
mathematics at a very deep level. In quantum physics it is the preferred language for 
describing symmetries and supersymmetries. Two of the greatest figures in its history 
are Mackey and Harish-Chandra. Their work (to use the words of Weyl) affords shade 
to large parts of present day mathematics and high energy physics. It is to their memory 
that this volume is lovingly dedicated. 


The essays in this volume are like a stroll through a garden of ideas of this rich subject: 
quantum algebras, super geometry, unitary supersymmetries, differential equations, 
nonarchimedean physics, are a few of the topics encountered along the way. The 
author, whose mathematical education evolved out of his interactions with Mackey and 
Harish-Chandra, concludes this volume with brief portraits of their work, embedded 
in the context of personal reminiscences. 


Mathematics 


> springer.com 


